Anything to Triples -- any23
http://code.google.com/p/any23/
Any23 is an open-source Java library that parses structured data out
of various Web document formats, and maps them into the RDF data
model. Any23 is based on code that has been developed for the Sindice
search engine at DERI. This is the initial release. It supports the
following input formats:
* RDF/XML
* Turtle (Notation 3)
* N-Triples
* RDFa embedded in XHTML and HTML
* Microformats: Adr, Geo, hCalendar, hCard, hListing, hResume,
hReview, License, XFN
The focus of this first release is to extract the code from the
Sindice codebase, and getting project infrastructure into place.
Future versions will focus on a more flexible API, improved
performance, more input data formats, higher-quality extraction, and
more output formats such as JSON.
Any23 can be downloaded from the project website:
http://code.google.com/p/any23/
Feedback is very welcome on the project mailing list:
http://groups.google.com/group/any23-dev
Best,
Richard