Unable to connect to database - 14:14:45 Unable to connect to database - 14:14:45 SQL Statement is null or not a SELECT - 14:14:45 SQL Statement is null or not a DELETE - 14:14:45 Botany & Mycology 2009 - Abstract Search
Unable to connect to database - 14:14:45 Unable to connect to database - 14:14:45 SQL Statement is null or not a SELECT - 14:14:45

Abstract Detail


The Challenges Posed by Large Data Sets in Assembling the Plant and Fungal Trees of Life

Cellinese, Nico [1], Beaman, Reed S. [2].

TOLKIN challenges: Handling large collaborative data in real-time.

TOLKIN provides information management for collaborative phylogenetic systematics research through a web-accessible interface. Core modules support taxonomic, tree-based hierarchies that are linked to molecular data (nucleotide and full-genome sequences), morphological observations, collections, and library citations. By designing TOLKIN as a web application rather than a personal desktop application, we are able to support distance collaboration in real-time. For example, there is a need for compiling large molecular or morphological data matrices that are inclusive of all data generated by multiple labs. In a web application, users are able to modify data and create new matrices and collaborators can immediately take advantage of the updated dataset. Based on lessons learned from TOLKIN Version 1, we have focused efforts in five key areas: 1) designing a more usable, friendly interface that gives project managers and users more control over their project data; 2) better support for import and export functions (e.g., fasta and nexus file formats) and automated data-mining for Genbank sequences; 3) data analysis support through Kepler scientific workflows and/or direct access to data analysis portals; 4) allowing users to document data and observations through citations, images and other media, semantic tagging, and hyperlinks to external resources; and 5) establishing a system of provenance for molecular and morphological data, by support for matrix subsets and versioning for matrices, character and character state observations, and sequences and alignments. Project users are provided access to their data through the portal at www.tolkin.org. TOLKIN development has been primarily funded through NSF AToL and PBI programs.


Log in to add this item to your schedule

Related Links:
http://www.tolkin.org


1 - University of Florida, Florida Museum of Natural History, Dickinson Hall, PO Box 117800, Gainseville, FL, 32611, USA
2 - Florida Museum of Natural History, Dickinson Hall, P.O. Box 117800, Gainesville, Florida, 32611, USA

Keywords:
none specified

Presentation Type: Symposium or Colloquium Presentation
Session: SY5
Location: Ballroom 2/Cliff Lodge - Level B
Date: Tuesday, July 28th, 2009
Time: 9:00 AM
Number: SY5003
Abstract ID:720