Skip to content
Simon Twigger edited this page Aug 17, 2010 · 4 revisions

MCW-RDF Project

Goals

The main aim of this project is to start creating code that converts some of the data at MCW into RDF so that it can be used elsewhere for interesting semantic-webby things. This is driven in part by our collaboration with the National Center for Biomedical Ontology where we are creating a lot of gene-to-tissue annotations that might work really well in RDF, integrated with other resources. With this in mind we’re starting to create some code to convert our existing public resources into RDF so that we can play with them and so that others can try it too. Here’s a list of what we’re working on:

  • RGD Genes data – converting the RGD GENES ftp file into RDF in n-triples format
  • RGD QTL data – converting the RGD QTL ftp file into RDF in n-triples format

Usage

  • To parse the RGD GENES FTP file found here: ftp://rgd.mcw.edu/pub/data_release/GENES_RAT
    • ruby rgdGenesToRDF.rb gene GENES_RAT > rgd_genes.rdf
  • To parse the RGD QTL FTP file found here: ftp://rgd.mcw.edu/pub/data_release/QTLS_RAT
    • ruby rgdGenesToRDF.rb qtl QTLS_RAT > rgd_qtls.rdf

To Do Items

  • Change the name of the rgdGenesToRDF.rb script as it does more than just genes!
  • Consider creating official RGD object ontology for objects in RGD
Clone this wiki locally