RubyGems Navigation menu

scrappy 0.1

Scrappy is a tool that allows extracting information from web pages and producing RDF data. It uses the scraping ontology to define the mappings between HTML contents and RDF data. An example of mapping is shown next, which allows extracting all titles from http://www.elmundo.es: dc: http://purl.org/dc/elements/1.1/ rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns# sioc: http://rdfs.org/sioc/ns# sc: http://lab.gsi.dit.upm.es/scraping.rdf# *: rdf:type: sc:Fragment sc:selector: *: rdf:type: sc:UriSelector rdf:value: "http://www.elmundo.es/" sc:identifier: *: rdf:type: sc:BaseUriSelector sc:subfragment: *: sc:type: sioc:Post sc:selector: *: rdf:type: sc:CssSelector rdf:value: ".noticia h2, .noticia h3, .noticia h4" sc:identifier: *: rdf:type: sc:CssSelector rdf:value: "a" sc:attribute: "href" sc:subfragment: *: sc:type: rdf:Literal sc:relation: dc:title sc:selector: *: rdf:type: sc:CssSelector rdf:value: "a" (The above code is serialized using YARF format, supported by LightRDF gem, as well as RDFXML, JSON, NTriples formats, which can also be used to define the mappings).

Gemfile:
= Copiar al portapapeles ¡Copiado!

instalar:
=

Versiones:

  1. 0.4.10 - December 01, 2011 (52 KB)
  2. 0.4.9 - November 28, 2011 (52 KB)
  3. 0.4.8 - November 24, 2011 (52 KB)
  4. 0.4.7 - November 24, 2011 (52 KB)
  5. 0.4.6 - October 05, 2011 (52 KB)
  6. 0.1 - October 07, 2010 (17 KB)
Mostrar todas las versiones (44 total)

dependencias de Runtime (6):

activesupport >= 2.3.5
camping = 2.0
lightrdf >= 0.1
markaby >= 0.7.1
mechanize >= 1.0.0
nokogiri >= 1.4.1

dependencias de Development (2):

hoe >= 2.6.0
rubyforge >= 2.0.4

Propietarios:

Autores:

  • Jose Ignacio

SHA 256 checksum:

= Copiar al portapapeles ¡Copiado!

Total de descargas 136.374

Para esta versión 3.315

Licencias:

N/A

Versión de Ruby requerida: Ninguno

Enlace: