RubyGems Navigation menu

scrappy 0.1

Scrappy is a tool that allows extracting information from web pages and producing RDF data. It uses the scraping ontology to define the mappings between HTML contents and RDF data. An example of mapping is shown next, which allows extracting all titles from http://www.elmundo.es: dc: http://purl.org/dc/elements/1.1/ rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns# sioc: http://rdfs.org/sioc/ns# sc: http://lab.gsi.dit.upm.es/scraping.rdf# *: rdf:type: sc:Fragment sc:selector: *: rdf:type: sc:UriSelector rdf:value: "http://www.elmundo.es/" sc:identifier: *: rdf:type: sc:BaseUriSelector sc:subfragment: *: sc:type: sioc:Post sc:selector: *: rdf:type: sc:CssSelector rdf:value: ".noticia h2, .noticia h3, .noticia h4" sc:identifier: *: rdf:type: sc:CssSelector rdf:value: "a" sc:attribute: "href" sc:subfragment: *: sc:type: rdf:Literal sc:relation: dc:title sc:selector: *: rdf:type: sc:CssSelector rdf:value: "a" (The above code is serialized using YARF format, supported by LightRDF gem, as well as RDFXML, JSON, NTriples formats, which can also be used to define the mappings).

Gemfile:
=

install:
=

Versions:

  1. 0.4.10 December 01, 2011 (52 KB)
  2. 0.4.9 November 28, 2011 (52 KB)
  3. 0.4.8 November 24, 2011 (52 KB)
  4. 0.4.7 November 24, 2011 (52 KB)
  5. 0.4.6 October 05, 2011 (52 KB)
  6. 0.1 October 07, 2010 (17 KB)
Show all versions (44 total)

Runtime Dependencies (6):

activesupport >= 2.3.5
camping = 2.0
lightrdf >= 0.1
markaby >= 0.7.1
mechanize >= 1.0.0
nokogiri >= 1.4.1

Development Dependencies (2):

hoe >= 2.6.0
rubyforge >= 2.0.4

Owners:

Authors:

  • Jose Ignacio

SHA 256 checksum:

=

Total downloads 138,265

For this version 3,357

Version Released:

Licenses:

N/A

Required Ruby Version: None

Links: