SeedMeLab (formerly SeedMe2) is a set of modular web-based building blocks to enable research teams to quickly share, visualize, and discuss the results of experiments, simulations and analyses.
SeedMe = Stream, Encode, Explore and Disseminate My Experiments
Data is an integral part of scientific research, and data size problems have become endemic as computation and analyses are producing an increasingly large amount of data that research teams are inevitably tasked with managing these rapidly growing data collections. Existing solutions are largely focused upon providing storage space, whether local or in the cloud, and a familiar folder tree-style hierarchy. While these file system solutions work, they separate the data from essential contextual information, such as metadata, descriptive text and equations, job execution parameters, visualizations, and on-going data discussion among the researchers. Important discussions, for instance, remain in email logs or forums, while descriptive text is left in README files or embedded in those same email logs and forums. This distribution of contextual information makes it harder to keep track of it all and keep data from being orphaned or misinterpreted. A more unified approach is needed that keeps data and context together within the same storage system.
SeedMeLab project has resulted in the development of data sharing and data management. It enables research teams to manage, share, search, visualize, and present their data in a web-based environment using an access-controlled, branded, and customizable website they own and control. It supports storing and viewing data in a familiar tree hierarchy but also supports formatted annotations, lightweight visualizations, and threaded comments on any file/folder. The system can be easily extended and customized to support metadata, job parameters, and other domain and project-specific contextual items. The software is open source and available as an extension to the popular Drupal content management system
This project is addressing cyberinfrastructure weaknesses by creating new building blocks to support rapid web-based data sharing, granular access controls, light-weight microformats for scientific data, and brief web-based visualizations to enable quick looks at transient data and preliminary results from HPC jobs, experiments, etc. These building blocks are cast as general-purpose APIs and Drupal-specific modules that may be used to quickly stand up web-based project-, lab-, and department-based data sharing services.
SeedMeLab is build upon Drupal content management system. It has a number of modular components that interoperate with each other to provide a host of features.
- Amit Chourasia, Principal Investigator
- Michael Norman, Co-Principal Investigator
- David Nadeau, Technical Architect
- Andrew Ferbert, Systems Administrator
- Michael Dwyer, Database Administrator
This work is supported by the National Science Foundation under Grant No. 1443083. "Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation."
We would like to express our gratitude to the Science Gateways Community Institute for providing support in refining value proposition, outreach strategy development and usability study.