LSID Resolver for Analytical Models
Coordinator(s): Matt Jones
Participants: Matt Jones, Chad Berkley, and the Kepler team
Description
This group will make available an
LSID resolver that can serve information about analyses and models in the form of scientific workflows that might be used with ecological and biological and other scientific data.
Technical Information
URL for prototype user interface: http://library.kepler-project.org
LSID authority(ies): kepler-project.org
LSID namespace(s): actor, kar
Hardware platform: Dell rackmount server, 4GB RAM, 2.5Terabyte storage array
Server platform: RHEL4, postgres 7.4, Apache 2, Tomcat 5, Metacat 1.6
LSID Software stack used: Java
RDF/OWL ontology used for metadata: In development
Approximate number of LSIDs stored: 150
Benchmarchs:
Other resources:
Roadmap, Milestones, Timeline
* Spring 2006: Set up prototype analysis and model workflow repository and
LSID resolver
* Spring 2006: Modify Kepler to be able to publish models with
LSIDs to this repository
Discussion, Implementation Issues
In the Kepler scientific workflow system (
http://kepler-project.org), scientific analyses and models can be represented as workflows that model the flow of data from one processing step to another. Processing steps, called 'actors' in Kepler, can be as fine grained (e.g., simple addition) or coarse-grained (e.g., the GARP model) as desired by the scientist. Kepler models are published in the Modeling Markup Language (
MoML?) and are executable using the Kepler Scientific Workflow system. Within Kepler, each analytical component that is published or that ships with Kepler is assigned its own
LSID. The component is serialized in a Kepler archive (.kar) file that contains a manifest listing its
LSID and other metadata. The .kar file can be shipped between Kepler systems (e.g., via email or web, etc) and opened by other Kepler users, allowing people to share their analytical work. Combined with strongly-versioned data sets that are also referenced by
LSIDs in the models (See
LSIDResolverForObservations), the KAR file can represent a complete scientific model and all of the data need to execute that model.
Analytical models in Kepler that might be of more general use can be saved from Kepler and published to the Kepler actor library (
http://library.kepler-project.org), which makes them available for download and use by other Kepler users. Most of this system is currently functional in prototype form.
Future features include the ability to search the Kepler actor library from directly within the Kepler user interface and directly import and use analytical components found through this search. This is analogous to the current
EcoGrid data search that is available in Kepler for locating ecological data, observational data, specimen data, and geophysical data through the
EcoGrid web service interfaces.
Lessons Learned, Conclusions, Recommendations
Categories
CategoryWorkingGroup
CategoryPrototypingWG