RubyGems Navigation menu

scrappy 0.1

Scrappy is a tool that allows extracting information from web pages and producing RDF data. It uses the scraping ontology to define the mappings between HTML contents and RDF data. An example of mapping is shown next, which allows extracting all titles from http://www.elmundo.es: dc: http://purl.org/dc/elements/1.1/ rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns# sioc: http://rdfs.org/sioc/ns# sc: http://lab.gsi.dit.upm.es/scraping.rdf# *: rdf:type: sc:Fragment sc:selector: *: rdf:type: sc:UriSelector rdf:value: "http://www.elmundo.es/" sc:identifier: *: rdf:type: sc:BaseUriSelector sc:subfragment: *: sc:type: sioc:Post sc:selector: *: rdf:type: sc:CssSelector rdf:value: ".noticia h2, .noticia h3, .noticia h4" sc:identifier: *: rdf:type: sc:CssSelector rdf:value: "a" sc:attribute: "href" sc:subfragment: *: sc:type: rdf:Literal sc:relation: dc:title sc:selector: *: rdf:type: sc:CssSelector rdf:value: "a" (The above code is serialized using YARF format, supported by LightRDF gem, as well as RDFXML, JSON, NTriples formats, which can also be used to define the mappings).

Gemfile:
= Copier Copié!

installation:
=

Versions:

  1. 0.4.10 - December 01, 2011 (52 ko)
  2. 0.4.9 - November 28, 2011 (52 ko)
  3. 0.4.8 - November 24, 2011 (52 ko)
  4. 0.4.7 - November 24, 2011 (52 ko)
  5. 0.4.6 - October 05, 2011 (52 ko)
  6. 0.1 - October 07, 2010 (17 ko)
Voir toutes les versions (44)

Dépendances de Runtime (6):

activesupport >= 2.3.5
camping = 2.0
lightrdf >= 0.1
markaby >= 0.7.1
mechanize >= 1.0.0
nokogiri >= 1.4.1

Dépendances de Development (2):

hoe >= 2.6.0
rubyforge >= 2.0.4

Propriétaires:

Auteurs:

  • Jose Ignacio

Total de contrôle SHA 256:

= Copier Copié!

Total de téléchargements 136 374

Pour cette version 3 315

License:

aucune

Version de Ruby requise: None

Liens: