/facet (pronounced "slashfacet") is a generic browser for
heterogeneous semantic web repositories. The browser works on any
RDFS dataset without any additional configuration. It has some
unique features.

The ACE Program is developing extraction technology to support
automatic processing of natural text, including classification,
filtering, and selection, with specific emphasis on the detection
and characterization of Entities, Relations, and Events

AceWiki is a semantic wiki that is powerful and at the same time
easy to use. Making use of the controlled natural language ACE, the
formal statements of the wiki are shown in a way that looks like
natural English. In order to help the users to write correct ACE
sentences, AceWiki provides a predictive editor

Acre is an open development environment for developing web apps
using Javascript. Acre is the backend for the Freebase Apps
platform, which provides hosted, social app development using
server-side Javascript. Acre can be obtained as an open source
release or can be used online at http://acre.freebase.com

ActiveRDF is a library for accessing RDF data from Ruby programs.
It can be used as data layer in Ruby-on-Rails. You can address RDF
resources, classes, properties, etc. programmatically, without
queries

The Aduna Metadata Server automatically extracts metadata from
information sources, like a file server, an intranet or public web
sites. The Aduna Metadata Server is a powerful and scalable store
for metadata

The Adaptive Information Disclosure (AID) project is part of a
larger effort that aims at the creation of a so-called Virtual Lab
environment for e-science . AID includes a suite of dynamic,
model-driven information and knowledge extraction tools on top of
an architecture for grid-based distributed data analysis, slated
for completion in 2008. The AIDA Toolkit is based off of Taverna

Algae is an RDF query language used in the W3C Annotea Server.
Algae can be used to query a graph, insert data into a graph, or
write rules to automatically insert data when a query is matched.
Because of this flexibility, Algae has also served as a research
platform for studies in interfacing with relational databases and
system information

The Alignment API is an API and implementation for expressing and
sharing ontology alignments. The Alignment API expresses alignments
in a uniform RDF way. The Alignment API itself is a Java
description of tools for accessing the common format. There are
many plug-ins for the API.

The Assisted Linked Data Consumption Engine (ALOE) aids in the
consumption from and fusion of Linked Data sources. ALOE achieves
this goal by discovering class and property mappings across
endpoints even when no schema information is available. ALOE also
provides several functions for transforming the data from the
source knowledge base into a format that corresponds to that of the
target knowledge base

The AMALGAM (Automatic Mapping Among Lexico-Grammatical Annotation
Models) project is an attempt to create a set of mapping algorithms
to map between the main tagsets and phrase structure grammar
schemes used in various research corpora. Software has been
developed to tag text with up to 8 annotation schemes. May be
abandoned.

Ambra is an innovative Open Source platform for publishing Open
Access research articles. It provides features for post-publication
annotation and discussion that allows for a "living" document
around which further scientific discoveries can be made. Ambra uses
a semantic repository which provides a way to store all content as
RDF triples

Amine is a Multi-Layer Platform implemented in Java. It provides
various Engines and GUIs to build a wide variety of Ontology-based
applications, Conceptual Graph based applications, Intelligent
Systems and Multi-Agents Systems

AnnoCultor Converter allows converting SQL databases, XML files,
and SPARQL datasets to RDF. Converters are written in XML in a
simple declarative way, and common XML editing skills are
sufficient to write one

AnnoCultor Tagger allows assigning semantic tags to your data. Here
the tags correspond to terms from existing vocabularies, e.g. term
'Paris' from Geonames contains translations in a dozen languages,
augmented with coordinates, population, and other data. AnnoCultor
can find and link this tag to a document, allowing new features,
such as showing this document on a map. AnnoCultor is suitable for
any vocabulary

Metafy's Anthracite Web Mining Desktop toolkit gives you the tools
you need to build powerful data processing systems with an
easy-to-use visual interface that makes complex manipulations
quickly possible. Anthracite is built for people who need to
transform internet sources and/or large data sets into integrated
information quickly and easily without scripting. Mac OS X only

AFS (Antidot Finder Suite) is an enterprise approach to information
search that is based on software agents. Antidot's approach is
built on "intelligent agents" dedicated to each source, where each
AFS agent is configured for the environment, access constraints,
format and structure of each data source. Agents may be developed
for websites, file servers, e-mail servers, archives, databases,
directories, XML streams, digital catalogs, etc. AFS also employs
semantic considerations of the search request.

Anything To Triples (any23) is a library and web service that
extracts structured data in RDF format from a variety of Web
documents. Currently it supports the following input formats:
RDF/XML, Turtle, Notation 3, RDFa; and the microformats ofAdr, Geo,
hCalendar, hCard, hListing, hResume, hReview, License, XFN

Apolda (Automated Processing of Ontologies with Lexical Denotations
for Annotation) is a plugin (processing resource) for GATE
(http://gate.ac.uk/). The Apolda processing resource (PR) annotates
a document like a gazetteer, but takes the terms from an (OWL)
ontology rather than from a list.

AquaBrowser is a faceted browser specifically designed for
libraries and works off of its own local content index. Faceted
search and graphical relatedness dispalys are included. Interfaces
are quite attractive

Aqueduct is a linked data semantic Web extension for Mediawiki that
provides widgets that can be placed on wiki pages to structure and
visualize semantic data, link to outside semantic datasources,
discover data in semantic datasources, and perform queries. Unlike
Semantic Mediawiki, Aqueduct is designed to work with semantic
(RDF) datasets that are external to the wiki

Ariel is a Ruby library that allows you to extract information from
semi-structured documents (such as websites). It is different to
existing tools because rather than expecting the developer to write
rules to extract the desired information, Ariel will use a small
number of labeled examples to generate and learn effective
extraction rules

ASMOV, which stands for Automated Semantic Mapping of Ontologies
with Validation, is an automatic ontology matching tool which has
been designed in order to facilitate the integration of
heterogeneous systems, using their data source ontologies.

ATLAS (Architecture and Tools for Linguistic Analysis Systems) is a
joint initiative of NIST, MITRE and the LDC to build a general
purpose annotation architecture and a data interchange format. The
starting point is the annotation graph model, with some significant
generalizations. Download now available from jATLAS -
http://sourceforge.net/projects/jatlas/

BAAGZ is plug-in assisted online service that brings together
search and social networking. BAAGZ uses its semantic description
of every Web site stored in localized repositories to determine
what that "baag" is about whether or not a given Web site is
specifically tagged by a user. The system then uses this knowledge
for search and retrieval. Baagz is currently available for Firefox
2+ only

Banach is a collection of operators that work on RDF graphs to
infer, extend, emerge or otherwise transform a graph into another.
You can think of it as a transformation pipeline for RDF with a
collection of implemented commands.

BaseVISor is a forward-chaining inference engine specialized to
handle facts in the form of RDF triples with support for RuleML,
R-Entailment and XML Schema Datatypes. BaseVISor is a Java
application and API.

Bibster is a Java-based semantics system which assists researchers
in managing, searching, and sharing bibliographic metadata (e.g.
from BibTeX files) in a peer-to-peer networked bibliographic
peer-to-peer system

Bigdata is a horizontally scaled open source architecture for
indexed data with an emphasis on semantic web data architectures.
Bigdata operates in both a single machine mode (Journal) and a
cluster mode (Federation)

Blacklight is an open source OPAC (online public access catalog).
That means libraries (or anyone else) can use it to allow people to
search and browse their collections online. Blacklight uses Solr to
index and search, and it has a highly configurable Ruby on Rails
front-end.

The Boost Graph Library is a generic interface that allows access
to a graph's structure, but hides the details of the
implementation. This is an ``open'' interface in the sense that any
graph library that implements this interface will be interoperable
with the BGL generic algorithms and with other algorithms that also
use this interface. The BGL provides some general purpose graph
classes that conform to this interface

A description logic (DL) reasoner that can be integrated into
Sesame. BOR is a SHQ(D) reasoner (SHQ(D) is SHOQ(D) without
nominals). It provides comprehensive DAML+OIL support and is
compliant with the latest OWL specifications. For performance
reasons, it is recommended to choose OWLIM instead

BorderFlow is a general-purpose graph clustering algorithm. It uses
solely local information for clustering and achieves a soft
clustering of the input graph. When considering a graph as the
description of a flow system, a cluster X can be understood as a
set of nodes such that the flow within X is maximal while the flow
from X to the outside is minimal. The idea behind BorderFlow is to
maximize the flow from the border of each cluster to its inner
nodes (i.e., the nodes within the cluster) while minimizing the
flow from the cluster to the nodes outside of the cluster

The BrownSauce RDF browser is a project to aggregate and present
arbitrary RDF data in as pleasing a manner as possible, that is a
'semantic web browser'. Brownsauce is a local http server; however
it should be trivial to add other front-ends

Bubastis is a tool for detecting asserted logical differences
between two ontologies, such as between versions. A stand alone
version of the tool is also available for download from the EFO
tools page. Bubastis is powered by the OWL-API

Caboto is an RDF-based Web annotations system to allow people to
annotate events and other things. The Caboto project was setup to
create a collaborative effort to fulfill the requirements of
Collaborative Research on the Web (CREW) - University of Bristol
and University of Manchester; Semantic Tools for Screen Arts
Research (STARS) - University of Bristol; and Integration Project
(CIP) - University of Bristol

Open Calais is free limited API web service to automatically attach
semantic metadata to content, based on either entities (people,
places, organizations, etc.), facts (person ‘x’ works for company
‘y’), or events (person ‘z’ was appointed chairman of company ‘y’
on date ‘x’). The metadata results are stored centrally and
returned to you as industry-standard RDF constructs accompanied by
a Globally Unique Identifier (GUID). It is an update of the earlier
ClearForest SWS

The Cashew project concerns the description and composition of
semantic web-services, unlike most efforts from the semantic web
community, the primary flavour of semantics concentrated on is
behavioural semantics. The Cashew project is inspired by both OWL-S
and WSMO, as well as by BPEL, Workflow Patterns and UML.

Content-Based Cross-Site Mining (CCM) of Web Data Records algorithm
combines techniques of extracting data records based on the
structure of documents (HTML tags) with an analysis of the
semantics of the content for better data record extraction

CEL is the first reasoner for the description logic EL+, supporting
as its main reasoning task the computation of the subsumption
hierarchy induced by EL+ ontologies. The most distinguishing
feature of CEL is that, unlike other modern DL reasoners, it
implements a polynomial-time algorithm

Charon is a framework for building RDFizing proxies that are
intended to wrap around existing web sites, screen-scrape their
HTML output and provide an RDF representation of that data. It
requires Apache Cocoon ver. 2.2. Older project no longer supported

Chimaera is a software system that supports users in creating and
maintaining distributed ontologies on the web. Two major functions
it supports are merging multiple ontologies together and diagnosing
individual or multiple ontologies.

ClearTK provides a framework for developing statistical natural
language processing components in Java and is based on the Apache
UIMA framework for text analysis. It includes an extraction
library, common interface and wrappers for machine learning
libraries, infrastructure for creating NLP components such as
sequential taggers, chunkers, syntactic parsers, semantic role
labeling, temporal resolution, etc., and wrappers for common NLP
components. such as the Snowball stemmer and OpenNLP syntactic
parser.

C-Link is search tool for finding related and possibly unknown
concepts that lie on a path between two known concepts. A demo of
C-Link available here is set up to search Wikipedia with the
ability to export your search to CMap. Based on Silverlight.

A platform for semantic Web applications with a SeRQL/SPARQL query
engine, interface to the The Yahoo! User Interface Library (YUI)
and libraries that support semantic search. The platform combines a
high performance in-core RDF store with flexible reasoning in
Prolog, query optimization. Prolog's interactive usage and
capabilities of recompiling modified source code while the system
remains alive greatly speedup development

CMS (CROSI Mapping System) is a structure matching system that
capitalizes on the rich semantics of the OWL constructs found in
source ontologies and on its modular architecture that allows the
system to consult external linguistic resources

CodeMirror is a JavaScript library that can be used to create a
relatively pleasant editor interface for code-like content ?
computer programs, HTML markup, and similar. If a mode has been
written for the language you are editing, the code will be
coloured, and the editor will optionally help you with indentation

COE (CmapTools Ontology Editor) is a specialized version of the
CmapTools from IMHC. COE -- and its CmapTools parent -- is based on
the idea of concept maps. A concept map is a graph diagram that
shows the relationships among concepts. Concepts are connected with
labeled arrows, with the relations manifesting in a
downward-branching hierarchical structure. COE is an integrated
suite of software tools for constructing, sharing and viewing OWL
encoded ontologies based on these constructs.

COEUS is a next-generation semantic Web-powered knowledge
management framework. It is targeted at rapid application
deployment of new applications in any research field, supported by
a comprehensive ontology and RDF-based configuration files

CognitionSearch , with beta examples for wikis (Wikipedia), health,
case studies, and government, is a lininguistic search technology.
It uses semi-supervised word disambiguation and a verly large
dictionary

COLANUT (Complex Linking in a NUTshell) interface implements
time-efficient schema matching algorithms that allow LIMES to
discover and suggest initial class and properties matchings for
linking. The whole is embedded in an easy-to-use GUI that allows
you to create link specifications easily and download them as XML
files or simply to run them online

Collaborative Protégé is a plug-in extension of the existing
Protégé system that supports collaborative ontology editing as well
as annotation of both ontology components and ontology changes. In
addition to the common ontology editing operations, it enables
annotation of both ontology components and ontology changes. It
supports the searching and filtering of user annotations, also
known as notes, based on different criteria. There is also an
online demo (http://smi-protege.stanford.edu/collab-protege/)

Collex is the social-software component of NINES, a collections and
exhibits builder that operates with NINES peer-reviewed resources.
With Collex, you can collect digital objects, and annotate and tag
them. Coming in 2007 is a custom online exhibit builder

CKS (Community Knowledge Services) is an application that
supplements the AFS (Antidot Finder Suite) search engine by
providing functions for monitoring, reorganizing, sharing and
delivering information. CKS lets each user reorganize and annotate
the information according to his own requirements, without having
to modify the underlying applications and data sources. Annotation
and collaboration is based on a folder metaphor

conStruct SCS is a structured content system that extends the basic
Drupal content management framework. conStruct enables structured
data and its controlling vocabularies (ontologies) to drive
applications and User Interfaces (semantic). conStruct provides
Drupal-level CRUD (create - read - update - delete), data display
templating, faceted browsing, full-text search, and import and
export over structured data stores based on RDF. Depending on roles
and permissions, a given user may or may not see specific datasets
or tools within the Drupal interface. Collaboration networks can
readily be established across multiple installations and non-Drupal
endpoints.

Still under formulation (Danny Ayers): A Converter from RDF is a
tool which converts RDF into an application-specific format for use
with existing tools and integration with other data. Typically this
will appear as part of a running system which provides a
domain-specific facet view of a given RDF application's data

Converter to RDF is a tool which converts application data from an
application-specific format into RDF for use with RDF tools and
integration with other data. This site is a listing of RDF
converters; see also the RDFizers list below

ConWeaver comprises modules for the extraction and integration of
information as well as search. Information is extracted from
distributed, heterogeneous data sources and represented in a
multilingual semantic knowledge network. ConWeaver is also able to
associate and classify documents with similar content or formal
similarities.

Conzilla2 is a second generation concept browser and knowledge
management tool with many purposes. It can be used as a visual
designer and manager of RDF classes and ontologies, since its
native storage is in RDF. It also has an online collaboration
server.

CORDER (COmmunity Relation Discovery by named Entity Recognition)
is an un-supervised machine learning algorithm that exploits named
entity recognition and co-occurrence data to associate individuals
in a community with their expertise and associates.

Corese stands for Conceptual Resource Search Engine. It is an RDF
engine based on Conceptual Graphs (CG) and written in Java. It
enables the processing of RDF Schema and RDF statements within the
CG formalism, provides a rule engine and a query engine accepting
the SPARQL syntax

Crowbar is a web scraping environment based on the use of a
server-side headless mozilla-based browser. It is used as a
research prototype to investigate how to enable the running of
Piggy Bank JavaScript scrapers from the command line and thus
automating web sites scraping.

csv2rdf4lod is a quick and easy way to produce an RDF encoding of
data available in Comma-Separated-Values (CSV). In its advanced
form, csv2rdf4lod is a custom reasoner tailored for more demanding
data integration. It can handle handle tabular data from
well-structured RDBMS dumps, but is designed specifically to handle
manually created data or data in the wild

CubicWeb is a semantic web application framework to efficiently
build Web applications by reusing components (called cubes) and
following object-oriented design principles. It main features are:
1) an engine driven by the explicit data model of the application;
2) a query language named RQL similar to SPARQL; 3) a selection and
view mechanism for semi-automatic XHTML/XML/JSON/text generation;
4) a library of reusable components

Cuebee is a flexible, extensible application for querying the
semantic Web. It provides a query builder interface to guide users
through the process of formulating complex queries. No technical
knowledge of query languages or the semantic Web is required. They
key enabler of the query builder is the ontology schema. A query
term can be a variable, instance or class in the ontology. The
system encodes the user query into SPARQL and submits to a server.
Cuebee is able to plug-and-play with any SPARQL Protocol compliant
server such as Joseki, Virtuoso, D2R-Server, etc.

CumulusRDF provides a simple HTTP interface to manage RDF data
stored in an Apache Cassandra cluster. It supports Linked Data and
triple pattern lookups. It can also act as a proxy server for other
Linked Data applications, allowing it to deploy any RDF dataset as
Linked Data

This is a listing of more than a dozen XSL transformations for
embedded HTML dialects into RDF using GRDDL. GRDDL is a technique
for using XML/XHTML dialects (especially microformats) as custom
RDF syntaxes by having each document point, directly or indirectly,
to a transformation to an RDF graph. RDFa is a design for mixing
RDF syntax into HTML. GRDDL accomodates a wider variety of dialects
at the expense of asking consumers to execute potentially untrusted
code. RDFa allows one parser to work for data from a variety of
domains and provides a direct relationship between the RDF data and
the HTML document structure, which provides better support for
copy-and-paste.

The Closed World Machine (CWM) data manipulator, rules processor
and query system mostly using using the Notation 3 textual RDF
syntax. It also has an incomplete OWL Full and a SPARQL access. It
is written in Python

Cytoscape is a large-scale graph visualization platform developed
within the biology community, but which is applicable to any
standard graph. It is based in part on GINY and Piccolo, and is
extendable with specific graph analysis and visualization
extensions

D2R MAP is a declarative language to describe mappings between
relational database schemata and OWL ontologies. This D2R processor
implements the D2R mapping language and exports data from a
relational database as RDF, N3, N-TRIPLES or as Jena models

D2R Server is a tool for publishing relational databases on the
Semantic Web. The server enables RDF and HTML browsers to navigate
the content of non-RDF databases, and allows applications to query
the database using the SPARQL query language

Dannotate is an Annotea compatible annotation client which uses
"Bookmarklets" to provided the ability to annotate selected text of
arbitrary web pages without the requirement to install a browser
plug-in

Dapper is a service that allows you to extract and use information
from any website on the Internet. Dapper acts as a Web service and
API maker, enabling you to build web applications and mashups using
data from any website without any programming.

DARQ is a query engine for federated SPARQL queries. It provides
transparent query access to multiple, distributed SPARQL endpoints
as if querying a single RDF graph. DARQ enables the applications to
see a single query interface, leaving the details of federation to
the query engine.

Install this script on your PHP+MySQL server to create your a space
for publishing web content within DBin (e.g a picture attachment or
your public key). Installing your own publishing service (instead
of using our convenience service, will give you the greatest
freedom and control over the material you decide to publish.

Data Mashups is an online service for small enterprises and
workgroups. It provides: 1) personalized portals (enterprise start
pages); 2) custom, composite business views and applications. Data
sources may include enterprise data, web services, web widgets, and
other web applications.

DataWiki is a wiki for structured data. It is usable to create and
edit structured data, create simple mashup applications, share
information, and enable easy input/output from a variety of
endpoints. It also has been launched on
http://datawiki.googlelabs.com/.

DBin brings the Semantic Web to the end users. By joining P2P
groups and communities, users can annotate any topic or subject of
interest and enjoy browsing and editing in a semantically rich
environment.

DBpedia Spotlight is a tool for annotating mentions of DBpedia
resources in text, providing a solution for linking unstructured
information sources to the Linked Open Data cloud through DBpedia.
DBpedia Spotlight can be used for Named Entity Recognition, Name
Resolution, amongst other information extraction tasks

DeepaMehta is a "networked semantic desktop" that replaces the
traditional computer desktop. DeepaMehta enables users to arrange
information of any kind and origin into topic maps. There also
graph interfaces and other differences from a standard desktop

Deki Wiki is a free open source wiki and application platform for
communities and enterprises. Deki Wiki is an easy to use and
sophisticated wiki for authoring, aggregating, organizing, and
sharing content. Deki Wiki is also a platform for creating
collaborative applications, or adding wiki capabilities to existing
applications. It has a complete application programming interface
(API) for programmers and extending Deki Wiki can be done in any
programming language

The Description Logic Complexity Navigator is a Web page that
allows users to review the complexity of reasoning tasks of various
description logics by adding or removing features. It includes a
comprehensive list of references to the literature

DiamondWiki is an experimental wiki based on the idea of faceted
navigation. Each wiki page is characterized by a list of attributes
such as subject or author, with these attributes used to construct
a dynamic directory of all the pages

The Disco - Hyperdata Browser is a simple browser for navigating
the Semantic Web as an unbound set of data sources. The browser
renders all information, that it can find on the Semantic Web about
a specific resource, as an HTML page; it is also available as a
Firefox bookmarklet

The Distiller can be used to read existing Web resources, or data
from an input form and generate the output in an alternative
format. It is based on the rdf.rb gem project. A very large number
of input and output formats are supported.

Django-RDF is an RDF engine implemented in a generic, reusable
Django app, providing complete RDF support to Django projects
without requiring any modifications to existing framework or app
source code, or incurring any performance penalty on existing
control flow paths.

DL-Learner is a tool for learning complex classes from examples and
background knowledge. It extends Inductive Logic Programming to
Description Logics and the Semantic Web. DL-Learner now has a
flexible component based design, which allows to extend it easily
with new learning algorithms, learning problems, reasoners, and
supported background knowledge sources. A new type of supported
knowledge sources are SPARQL endpoints, where DL-Learner can
extract knowledge fragments, which enables learning classes even on
large knowledge sources like DBpedia, and includes an OWL API
reasoner interface and Web service interface.

DOAPspace spiders various open source package indexes and and
creates DOAP profiles, with more than 45,000+ now available. Also
has an online DOAP validator. The DOAP input is obtained over Atom
feeds for new software releases from SourceForge, Python Package
Index and Freshmeat

DogmaModeler is a free and open source, ontology modeling tool
based on ORM. The philosophy of DogmaModeler is to enable non-IT
experts to model ontologies with a little or no involvement of an
ontology engineer

Doozer is an on-demand domain hierarchy creation application that
is available as a Web service. It extracts a domain model from
Wikipedia using its category hierarchy, link graph and concept
descriptors. Inputs are a seed concept and optional broader domain
concept

dotNetRDF is a .Net library using the latest versions of the .Net
framework to provide a powerful and easy to use API for working
with RDF. The system has support for various triplestores and SQL
databases.

DRAGO reasons across multiple distributed ontologies interrelated
by pairwise semantic mappings, with a vision of peer-to-peer
mapping of many distributed ontologies on the Web. It is
implemented as an extension to an open source Pellet OWL Reasoner.

DReW is a prototype DL reasoner over LDL+ ontologies and a
prototype reasoner for dl-programs over LDL+ ontologies under
well-founded semantics. It is not well developed or documented; it
can be downloaded
(http://www.kr.tuwien.ac.at/research/systems/drew/download.html)

DSpace is an open source digital asset managment software platform
that enables institutions to capture and describe digital content.
It runs on a variety of hardware platforms and supports OAI-PMH
version 2.0.

Dydra is a cloud-based graph database. Data is updated and accessed
via SPARQL via a RESTful API. The website is built with nginx, Ruby
and Ruby on Rails, relying heavily on the RDF.rb framework. The
back-end infrastructure is based on core messaging technologies
AMQP and Redis. Dydra uses a proprietary distributed SPARQL query
engine called SPOCQ that is built in Lisp

ED is the Entity Describer, a mashup of the Connotea social tagging
system, an index of semantic web-accessible controlled
vocabularies, and a new public RDF database for storing social
semantic annotations. It can be applied to other controlled
vocabularies, other social tagging sites and is based on
Greasemonkey scripts in Firefox.

Elda is a Java implementation of the Linked Data API provided by
Epimorphics. It comes with some pre-built examples which allow you
to experiment with the style of query and get started with building
your own specs.

ELK is an open source reasoner for the lightweight ontology
language OWL 2 EL. It is based on Java and can be controlled using
the OWL API, Protégé, the Snow Owl ontology editor, or a basic
command line interface

Elmo provides a simple API to access ontology oriented data inside
a Sesame RDF repository. The domain model is simplified into
independent concerns that are composed together for
multi-dimensional, inter-operating, or integrated applications.

EntityCube is a research prototype for exploring object-level
search technologies, which automatically summarizes the Web for
entities (such as people, locations and organizations) with a
modest web presence

Erca is a framework that eases the use of Formal and Relational
Concept Analysis, a neat clustering technique. Though not strictly
an ontology tool, Erca could be implemented in a work flow that
allows easy import of formal contexts from CSV files, then
algorithms that computes the concept lattice of the formal contexts
that can be exported as dot graphs (or in JPG, PNG, EPS and SVG
formats). Erca is provided as an Eclipse plug-in

Euler is an inference engine supporting logic based proofs. It is a
backward-chaining reasoner enhanced with Euler path detection. It
has implementations in Java, C#, Python, JavaScript and Prolog. Via
N3 it is interoperable with W3C Cwm

Euler is an inference engine supporting logic based proofs. It is a
backward-chaining reasoner enhanced with Euler path detection. It
has implementations in Java, C#, Python, JavaScript and Prolog. Via
N3 it is interoperable with W3C Cwm

Exhibit is a lightweight structured data publishing framework that
lets you create web pages with support for sorting, filtering, and
rich visualizations by writing only HTML and optionally some CSS
and JavaScript code. Its internal data representation is in RDF and
its interchange format is JSON; Babel is its translation
complement.

Explorator is a tool for exploring RDF data by direct manipulation.
Explorator’s web interface allows users to explore a
semi-structured RDF database to both gain knowledge and answer
specific questions about a domain, through browsing, search, and
exploration mechanisms

FactForge is a consolidated access point to the Web of data. It
allows users to find resources and facts based on the semantics of
the data, like web search engines index WWW pages and facilitate
their usage. It is based on resources in the Linking Open Data
(LOD) cloud. It provides efficient mechanisms to query data from
multiple datasets and sources, considering their semantics.
FactForge is designed also as a use-case for large-scale reasoning
and data integration and is based on the OWLIM semantic repository.

Factual is a platform where anyone can share and mash open data on
any subject. Factual is free to re-use, analyze, download,
manipulate, build upon, and mash with other tables. Developers can
get results via an API in either JSON or XML. Factual also has some
powerful Web extractors.

Falcon-AO (Finding, aligning and learning ontologies) is an
automatic ontology matching tool that includes the three elementary
matchers of String, V-Doc and GMO. In addition, it integrates a
partitioner PBM to cope with large-scale ontologies.

Falcons is a search engine for the Semantic Web that offers
keyword-based and concept search. Results are provided with labels,
types, provenance and usage information. An "Entity Browse"
interface also enables users to directly browse an entity with a
known URI reference.

The Fascinator is a software platform for developing
repository-based solutions. The Fascinator's architecture is based
on harvesting digital data from files, syndication feeds and Web
forms; transforming data for extracting metadata and creating new
outputs; storing the data; and indexing the data with Solr for
faceted searching

Faviki is a social bookmarking tool which allows you to tag
webpages you want to remember with Wikipedia terms. This means that
everybody uses the same names for tags from the world's largest
collection of knowledge

Fedora is open source software, a robust integrated
repository-centered platform that enables the storage, access and
management of virtually any kind of digital content, with an
emphasis on digital library assets, metadata and semantics\

FedX is a tool to setup on-demand federations by specifying a list
of relevant datasets (e.g. SPARQL endpoints from the LOD cloud) and
to query these in an efficient and transparent way as a virtually
combined dataset. Implemented as a Sesame SAIL, FedX can be used in
Java applications or via a command line interface

Fenfire is a Free Software project developing a computing
environment in which you can express the "everything is related to
everything else" relationships and benefit from them. Currently,
Fenfire is best applicable to browsing and editing RDF files

Fhat is RDF virtual machine (RVM) that is meant to be machine
independent. RVM software is written in typical human source code,
compiled to RDF triple-code, and processed by a computing machine
represented in RDF. Fhat is to be released in Fall 2007 and will be
programmed using Neno (see other entry).

Fingerpoint is a protocol, similar to Webfinger, allowing people to
associate data (such as their name, homepage link, etc) with their
e-mail address, and allowing third parties to discover that
information. It has the advantages of: 1) uses FOAF for
descriptions; 2) uses SPARQL for querying; 3) integrates with
linked data; and 4) is an easy service to set up. A draft spec is
also available at: http://buzzword.org.uk/2009/fingerpoint/spec

Fistful of LOD helps enrich an already existing web page content by
retrieving more information on the Open Data. It first fetches some
some text information using X-Path on an existing HTML page; then
inputs text in a SPARQL query upon any SPARQL endpoint; and then
injects HTML code based on the results in the source page

Flare promises to expose the power of Solr through a Rails-based
user interface. Integral to Flare will be a general purpose faceted
browsing, auto-suggest, folksonomy tagging/annotating, smart
folders, Atom support, and much more. Flare communicates with Solr
via the solrb Ruby-Solr [WWW] DSL (domain specific language)

FlexViz is a Flex-based, Protégé-like client-side ontology
creation, management and viewing tool. Very impressive and quite
comprehensive in functionality as an ontology development framework
and manager

The online Flickr2RDF service uses the Flickr API to extract
metadata from Flickr's photo repository, and generates an RDF
description. If flickr 'note' presents, it will be translated into
W3Photo Image Region Vocabulary, and be demonstrated via XSLT.

Flickcurl is a C library for the Flickr API, handling creating the
requests, signing, token management, calling the API, marshalling
request parameters and decoding responses. It uses libcurl to call
the REST web service and libxml2 to manipulate the XML responses.
It includes a program flickrdf to turn photo metadata, tags and
machine tags into RDF descriptions of photso and tags.

Flint SPARQL Editor is a within-browser SPARQL query editor and
assistant based on the CodeMirror framework. The tool provides
auto-completion sensitive to the syntactic context, SPARQL keywork
prompts and buttons enabled/disabled according to context, and
specific assists depending on the datasets available and loaded

FLORA-2 is an advanced object-oriented knowledge base language and
application development environment. The language of FLORA-2 is a
dialect of F-logic with numerous extensions, including
meta-programming in the style of HiLog and logical updates in the
style of Transaction Logic. FLORA-2 was designed with extensibility
and flexibility in mind, and it provides strong support for modular
software design through its unique feature of dynamic modules.

FLORID (F-LOgic Reasoning In Databases) is a deductive
object-oriented database system employing F-Logic as data
definition and query language. Florid has been extended for
handling semistructured data in the context of information
integration from the semantic Web.

FluentEditor for OWL is a comprehensive tool for editing and
manipulating complex ontologies using Controlled Natural Language.
FluentEditor provides an alternative to XML-based OWL editors, that
may be easier to use for some human users. It uses Controlled
English as its knowledge modeling language. It is supported by a
predictive editor that prohibits entering a sentence that is not
grammatically or morphologically correct. Windows only

F-OWL is an ontology inference engine for the Web Ontology Language
OWL. The ontology inference mechanism in F-OWL is implemented using
Flora-2, an object-oriented knowledge base language and application
development platform that translates a unified language of F-logic,
HiLog, and Transaction Logic into the XSB deductive engine

The Federated knOwledge eXtraction (FOX) framework integrates and
merges the results of frameworks for Named Entity Recognition,
Keyword/Keyphrase Extraction and Relation Extraction by using
machine learning techniques. By these means, FOX can generate RDF
out of natural language with improved accuracy

Freebase is an online data posting and exchange venue, organized
similarly to social networking sites only with a data emphasis; the
data is available under Creative Commons but the software is not
open source

Fresnel is a simple, browser-independent vocabulary for specifying
how RDF graphs are presented. work in SIMILE also includes
demonstration code for a Java-based, Fresnel-aware application. The
home for Fresnel work has been relocated to
http://www.w3.org/2005/04/fresnel-info/

The Fullish Test Suite is a suite of characteristic OWL 2 Full
reasoning test cases. The test cases are not meant to have
important use cases but, rather, their purpose is to demonstrate
different distinguishing semantic aspects of the OWL 2 RDF-Based
Semantics, which are typically not shown by either OWL 2 DL or by
the OWL 2 RL/RDF rules

Fusion is a tool for exploring and expliciting relationships in
Linked Data. It lists all undirected paths between two given
resources (URIs), with a specified maximum length; allows the user
to define new properties based on composing a generalization of
paths; allows the user to visualize the derived properties and its
instances; and provides a separate SPARQL endpoint with the
extended data.

Fuzz is designed to detect RDFa and display it to the person
browsing. Fuzz is a native Firefox plug-in that uses librdfa for
it's processing back-end. It is most useful for detecting embedded
semantic information in web pages and performing actions on that
semantic data.

GeoMaker creates microformats and maps from geographical
information embedded in texts. You can either provide a URL to load
and hit the "load content" button or start typing your own text and
hit the "get locations" button to continue.

geometry2rdf is a library for generating RDF files for geometrical
information (which could be available in GML or WKT). The GML and
WKT is manipulated with GeoTools. The current version of the
library works with Oracle geospatial databases and relies on Jena

gFacet is an Adobe Flex tool for browsing RDF data that is
accessible via arbitrary SPARQL endpoints (e.g. DBpedia). It
combines graph-based visualization with faceted filtering
techniques. The graph-based visualization of facets supports the
integration of different domains and an efficient exploration of
highly structured and interrelated datasets.

GLOW is a visualization for OWL ontologies, based on Hierarchical
Edge Bundles. Hierarchical Edge Bundles is a new visually
attractive technique for displaying adjacency relations in
hierarchical data, such as concept structures formed by
`subclass-of’ and `type-of’ relations. The displayed adjacency
relations can be selected from an ontology using a set of common
configurations, allowing for intuitive discovery of information. It
is a visualization library based on OWL API, as well as a plug-in
for Protégé

gnizr™ (g?-n?z?r) is an open source application for social
bookmarking and web mashup. It is easy to use gnizr to create a
personalized del.icio.us-like portal for a group of friends and
colleagues to store, classify, and share information, and
mash-it-up with information about location. It organizes its tags
via SKOS and uses Geonames.

The gnogno framework is an approach to bind RDF models from RDF2Go
to Swing and Eclipse SWT widgets. It allows to edit text values,
make lists, and program User Interfaces (semantic) using a clear
design pattern

GoNTogle is a semantic annotation and search tool, built on top of
Lucene and Protege. It supports manual and automatic annotation of
several types of documents (doc, pdf, rtf, txt, odt) using ontology
concepts

The Google Refine RDF extension adds a graphical user
interface(GUI) for exporting data of Google Refine projects in RDF
format. The export is based on mapping the data to a template graph
using the GUI.

Graphite is an open source PHP Library, built on top of ARC2, to
make it easy to do stuff with RDF data really quickly, without
having to naff around with databases. It is not intended to be
scalable, or a way of authoring RDF data.

Graphl is a generic graph visualization and manipulation tool
written in Java. Graphl reads and writes RDF files, visualizes them
in a flexible and customizeable way and allows users to edit them
intuitively

Greengrass is a C# framework providing a high-level API for the
Resource Description Framework (RDF), allowing RDF triples to be
parsed, stored, and manipulated. Greengrass supports serialization
and deserialization of CLR types to plain RDF, OWL, and other RDF
vocabularies. Greengrass should work with any CLR-complied
language, like C#, BOO, IronPython, Nemerle, IKVM Java, and VB.NET

Gremlin is a graph-based programming language. The documentation
herein will provide all the information necessary to understand how
to use Gremlin for graph query, analysis, and manipulation. It is
based on XPath.

Grit (Grokkable RDF Is Transformable) is an XML format intended to
solve the shortcomings of the often cumbersome and varied RDF/XML
format. Grit was originally designed for use in XSLT, but has
potential to be a generally versatile XML-format for RDF.

GroupMe! combines Semantic Web and Web 2.0 technologies to enable a
new kind of social tagging system that can create groups of
multimedia web resources via drag & drop, style groups (like
mind maps), tag resources and groups, search for other groups and
resources, and write applications using the GroupMe! API

Open source graphical ontology browser and editor. The GrOWL
browser can be used inside a web browser or as a stand-alone
application. When used inside a browser, it supports JavaScript
interaction so that it can be used as a concept chooser with
implementation-defined operations

Gruff is a triple-store browser that displays visual graphs of
subsets of a store's resources and their links. By selecting
particular resources and predicates, you can build a visual graph
that displays a variety of the relationships in a triple-store.
Gruff can also display tables of all properties of selected
resources or generate tables with SPARQL queries, and resources in
the tables can be added to the visual graph.

Harava is a metadata aggregator and validator tool. It collects
semantic information (RDF, RDFa, Dublin Core metadata etc) from web
sources, validates it according to a chosen schema, and produces a
clean RDF output file that can then be used and stored in a content
management system or other data repository

HAWK is a repository framework and toolkit that supports OWL. It
provides APIs as well as implementations for parsing, editing,
manipulating and preservation of OWL ontologies. It contains the
storage models of SimpleMemory (memory based storage without DL
inferences), DLMemory (memory based storage with DL inferences),
SimpleDB (database storage without DL inferences), and DLDB
(database storage with DL inferences). The system supports MS
Access, PostgreSQL and MySQL

Haystack is a research program focusing on information access,
analysis, management, and distribution. Its goal is to make it
easier for people to collect, organize, find, visualize, and share
their information. It sponsors a number of specific projects.

HbaseRDF uses Hbase + MapReduce to store RDF data and execute
queries (e.g., SPARQL) on them. Hbase is a Bigtable-like storage
system for the Apache Lucene Hadoop project. Google's Bigtable is a
distributed storage system for structured data, and is a very
effective mechanism for storing very large amounts of data in a
distributed environment.

Heart of Gold is a middleware for the integration of deep and
shallow natural language processing components. It provides a
uniform and flexible infrastructure for building applications that
use Robust Minimal Recursion Semantics (RMRS) and/or general XML
standoff annotation produced by NLP components

HermiT is a theorem prover for description logics (DLs). HermiT
implements a novel hypertableau reasoning algorithm. It uses the
OWL 2 API as the API for loading and managing ontologies. The
current version is a research prototype.

H-Maps is a commercial suite of tools for building topic maps
applications, consisting of a topic maps engine and server, a
mapping framework for converting from legacy data, and a navigator
for visualizing data. It is typically used in bioinformatics (drug
discovery and research, toxicological studies, etc), engineering
(support and expert systems), and for integration of hetereogeneous
data. It supports the XTM 1.0 and TMAPI 1.0 specifications

Hoolet is an implementation of an OWL-DL reasoner that uses a first
order prover. The ontology is translated to collection of axioms
(in an obvious way based on the OWL semantics) and this collection
of axioms is then given to a first order prover for consistency
checking. This prototype is for Linux only due to the the bundled
reasoner

HotPlanner is an automated planning system based on HTN and
integrated with Pellet that uses OWL and RDF to represent planning
domains and state knowledge base management, reasoning, and query
answering.

HtmlUnit is a GUI-less Web browser written in Java. It allows
high-level manipulation of Web sites from other Java code,
including filling and submitting forms and clicking hyperlinks. It
also provides access to the structure and the details within
received Web pages. HtmlUnit emulates parts of browser behavior
including the lower-level aspects of TCP/IP and HTTP. It can also
be used for Web scraping

Hunter Gatherer is a Firefox extension that makes it easy to
capture chunks of web pages on the go. Collected info is added to a
"collection"; you may create as many collections as you'd like. HG
also automatically captures the location/URL of the collection
page.

hurl makes HTTP requests and allows you to choose the request
method, customize headers and POST parameters, add basic
authorization, and even follow redirects. Results are shown as a
nicely formatted request and response. hurl is like an online curl.

Hyena (Hypergraph Editor and Navigator) is an RDF editor that
allows one to mix and match RDF vocabularies to flexibly model any
kind of data. It contains plugins for many vocabularies including
semantic Web, Fresnel, SPARQL, etc. It can be used as an Eclipse
plugin or as a standalone Ajax utility.

HyperGraphDB is a general purpose, extensible, portable,
distributed, embeddable, open-source data storage mechanism. It is
a graph database designed specifically for artificial intelligence
and semantic web projects, it can also be used as an embedded
object-oriented database for projects of all sizes

Hyperthing is a semantic Web URI validator that helps to publish
the Linked Data. Hyperthing determines if the requested URI
identifies a real world object or a Web document; checks whether
the published URI follows the W3C hash URIs and 303 URI practice;
and checks the validity of redirects (including 301, 302 and 307)

HyperTwitter allows users to consolidate or relate pairs of Twitter
hashtags, user IDs, or arbitrary URIs using a simple syntax. The
Hypertwitter application then can extract these to expand queries
or to create an RDF graph for use in other Web applications

IBM Semantics Toolkit is designed for storage, manipulation, query,
and inference of ontologies and corresponding instances. A major
purpose is to establish an end-to-end ontology engineering
environment tightly integrated with dominant Meta- Object Facility
(MOF)-based modeling and application development tools. The
semantics toolkit contains three main components (Orient, EODM, and
RStar), which are designed for users of different levels.

IF-Map is an Information Flow based ontology mapping method. It is
based on the theoretical grounds of logic of distributed systems
and provides an automated streamlined process for generating
mappings between ontologies of the same domain.

This is a state of the art NER tagger that tags plain text with
named entitites (people / organizations / locations /
miscellaneous). It uses gazetteers extracted from Wikipedia, word
class model derived from unlabeled text and expressive non-local
features

Instead of tags, ImageNotion are used for the annotation of images.
ImageNotion extends the possibilities of text based image
annotation. Each ImageNotion visually represents a person an object
or an entity, e.g. 'Nicole Kidman', 'Tank', 'Female' by so called
'imagenotions'. Each imagenotion is described with tags and
synonymous meanings in different languages and it is possible to
create relations between two ImageNotion, e.g. to state that
'Nicole Kidman' is 'Female'. This reduces the annotation time,
because the image annotations benefit from all stored information
of the ImageNotion used for annotation.

iMapping is a technique for visually structuring information
objects. It supports the full range from informal note taking over
semi-structured personal information management to formal knowledge
models. With iMaps, users can easily go from overview to
fine-grained structures while browsing editing or refining the
knowledge base in one comprehensive view

This toolkit extracts the infobox RDF data from the Wikipedia data
dumps in XML format, and the extracted data are stored in a
user-specified PostgreSQL database. It consists of an infobox data
extraction tool, a data cleansing tool, and the RDF conversion tool

INQLE (Intelligent network of Querying and Learning Engines) is an
open source server with Jena SDB back-end datastore. It runs
automated, random machine learning experiments on semantic data,
and stores any discovered correlations as RDF. It also provides
tools for loading spreadsheet data into the RDF database. It uses
RapidMiner

Via this endpoint you can write and test executable English rules
using a browser. Then you can extend this Java client endpoint stub
to make your rules available in your Service Oriented Architecure.
You can use your rules to automatically generate and run complex
SQL over networked databases, e.g. containing RDF triples, with
explanations in English, at the business or scientific level.

iQvoc is a tool for publishing and maintaining SKOS vocabularies.
iQvoc addresses the needs of typically distributed editorial teams
that want to cooperate in continuous enhancements in order to
provide a serious point of reference for the terminology of their
respective domain of discourse.

The Internet Reasoning Service - IRS - is KMi’s Semantic Web
Services framework, which allows applications to semantically
describe and execute Web services. The IRS supports the provision
of semantic reasoning services within the context of the Semantic
Web

iServe is a platform for publishing Semantic Web Services as linked
data, no matter their original format (try the iServe Browser).
iServe builds upon the Web of Data principles and uses as its core
what we refer to as the Minimal Service Model, a minimal vocabulary
for describing services in RDF, which abstracts away from the
original approach used for annotating the services, such as SAWSDL,
WSMO-Lite, MicroWSMO or OWL-S

JATE (Java Automatic Term Extraction) toolkit is a library with
implementation of several state-of-the-art term extraction
algorithms. It also provides a generic development and evaluation
framework for implementing new term extraction algorithms. JATE is
built on the previous JATR library. Currently the library has
implemented 6 state-of-the-art algorithms: 1) basic term frequency;
2) TF-IDF; 3) Weirdness; 4) C-value; 5) GlossEx; and 6) TermEx

Jena is a Java framework to construct Semantic Web Applications. It
provides a programmatic environment for RDF, RDFS and OWL, SPARQL
and includes a rule-based inference engine. It also has the ability
to be used as an RDF database via its Joseki layer. See the jena
discussion list for more information

The Jena GRDDL Reader is an implementation of GRDDL for the Jena
Semantic Web Framework, using the Saxon XSLT Processor. Jena is
open source and grown out of work with the HP Labs Semantic Web
Programme

jenabean uses Jena's flexible RDF/OWL api to persist Java beans. It
takes an unconventional approach to binding that is driven by the
Java object model rather than an OWL or RDF schema. jenabean is
annotation based and does not place any interface or extension
requirements on the Java object model. By default jenabean uses
typical Java bean conventions to derive RDF property URIs.

JeromeDL is an open source digital library that uses Semantic Web
technology to provide better access to its resources. The systems
provides a distributed catalogue, maintained by the users, and
resource annotation features that aim to overcome problems related
to security and privacy that are common in traditional
implementations of collaborative filtering systems

JessTab is a plug-in for Protégé that allows you to use Jess and
Protégé together. JessTab provides a Jess console window where you
can interact with Jess while running Protégé. Furthermore, JessTab
extends Jess with additional functions that allows you to map
Protégé knowledge bases to Jess facts. Also, there are functions
for manipulating Protégé knowledge bases from Jess.

JFresnel is a Java library that implements the Fresnel
specification for various RDF APIs, such as Jena and Sesame.
Fresnel is a presentation vocabulary for Semantic Web data designed
to be application and representation paradigm independent.

Jigs4OWL is a semantic mashup framework which offers a Web-enabled
semantic infrastructure while hiding all non-relevant technical
details. It allows to easily build web browser applications with
the help of customizable widgets which aggregate, visualize, and
mashup semantic content

jtrioo is the Java implementation of the 'trioo' project, which has
its objective to easily manage RDF data directly from
object-oriented programming languages, without negative impacts on
their object-oriented designs while still keeping the semantics of
the data accurate

JXML2OWL API is a library for mapping XML schemas to OWL Ontologies
on the Java platform. It creates an XSLT which transforms instances
of the XML schema into instances of the OWL ontology. JXML2OWL
Mapper is GUI application using the JXML2OWL API

KAON is an open-source ontology management infrastructure targeted
for business applications. It includes a comprehensive tool suite
allowing easy ontology creation and management and provides a
framework for building ontology-based applications. An important
focus of KAON is scalable and efficient reasoning with ontologies

KM is the knowledge representation language. KM is a powerful,
frame-based language with clear first-order logic semantics. It
contains sophisticated machinery for reasoning, including selection
by description, unification, classification, and reasoning about
actions using a situations mechanism. Its origins were the Theo
language and the (now obsolete) language KRL.

Knoodl is sort of an ontology editor, registry/repository, and wiki
all rolled into an easy to use online application. It uses a wiki
framework for the collaborative development of community ontology
vocabularies, which can then be documented or exported by use in
other ontology tools (OWL standard output format).

KnowWE is a Java-based semantic knowledge wiki based on the
implementation of JSPWiki. It's parsing engines and problem-solvers
also build on the d3web project. KnowWE stands for Knowledge Wiki
Environment and emphasizes the distributed development of
problem-solving knowledge within a semantic wiki.

Kraken is an application for managing knowledge objects, which can
be documents, remote or locally cached Web pages, personal
information, todo list items, appointments, and so on. It is
especially useful for researchers or students to manage their
information. Users can annotate these knowledge objects with
metadata, perform complex queries, and present the results as HTML
pages. Kraken uses RDF as its native format, allowing its data to
be easily read by external applications

Krextor (the KWARC RDF Extractor) is an extensible XSLT-based
framework for extracting RDF from XML, supporting multiple input
languages as well as multiple output RDF notations. Krextor
provides convenience templates that try to do “the right thing”™ in
many common cases, as to reduce the need for manually writing
repetitive code.

The Large Knowledge Collider (LarKC) is a platform for massive
distributed reasoning that aims to remove the scalability barriers
of currently existing reasoning systems for the Semantic Web. It is
based in part on Cyc as a reasoning engine and has a plug-in
architecture

LARQ is free-text indexing for SPARQL. LARQ is a combination of ARQ
and Lucene. It gives ARQ the ability to perform free text searches.
Lucene indexes are additional information for accessing the RDF
graph, not storage for the graph itself

LDIF is a software component for building Linked Data applications
which translates heterogeneous Linked Data from the Web into a
clean, local target representation while keeping track of data
provenance

LDPath is a simple path-based query language similar to XPath or
SPARQL property paths that is designed for querying and retrieving
resources from the Linked Data Cloud. It does so by following RDF
links between resources and servers. The LDPath project is a
collection of generic libraries that are independent of the
underlying RDF implementation

The LDSpider project aims to build a web crawling framework for the
linked data web. Requirements and challenges for crawling the
linked data web are different from regular web crawling, thus this
projects offer a web crawler adapted to traverse and harvest
sources and instances from the linked data web.

The Lemur Toolkit is a open-source toolkit designed to facilitate
research in language modeling and information retrieval. Lemur
supports a wide range of industrial and research language
applications such as ad-hoc retrieval, site-search, and text
mining.

LENA (LEns based NAvigator) is a Fresnel lens-based RDF Navigator
with SPARQL selector support. A "lens" represents a particular view
onto RDF data and is described by the Fresnel Display Vocabulary.
LENA enables viewing RDF data in a web browser, rendered according
to the specif lens descriptions provided. LENA supports the use of
multiple lenses and is now integrated into the Simile Fresnel
engine. See also http://code.google.com/p/lena/

LESS is an end-to-end approach for the syndication and use of
linked data based on the definition of visualization templates for
linked data resources and SPARQL query results. Such syndication
templates are edited, published and shared by using LESS'
collaborative Web platform. Templates for common types of entities
can then be combined with specific, linked data resources or SPARQL
query results and integrated into a wide range of applications,
such as personal homepages, blogs/wikis, mobile widgets etc.

Lexaurus Editor is for off-line creation and editing of
vocabularies, taxonomies and thesauri. It supports import and
export in Zthes and SKOS XML formats, and allows hierarchical /
poly-hierarchical structures to be loaded for editing, or even
multiple vocabularies to be loaded simultaneously, so that terms
from one taxonomy can be re-used in another, using drag and drop.

LexOWL is a plug-in for Protégé 4. In order to add more powerful
functionality (''e.g.'', inferencing, editing) to the existing
infrastructure and align LexGrid more closely with various Semantic
Web technologies, the LexOWL plugin for Protégé 4 provides a way
for representing the ontologies modeled within the LexGrid
environment in OWL. A source for downloading this tool has not been
found

librdfa is a pure C implementation of a standards-compliant RDFa
parser. The library is quite easy to use (there are only 5
functions). librdfa is stream-based (very little memory usage),
very small (the library is around 19KB) and fast.

LILY is a system matching heterogeneous ontologies. LILY extracts a
semantic subgraph for each entity, then it uses both linguistic and
structural information in semantic subgraphs to generate initial
alignments. The system is presently in a demo version only.

The LIMES (Link Discovery Framework for Metric Spaces) framework is
a time-efficient and lossless approaches for large-scale link
discovery based on the characteristics of metric spaces. It is
available as a standalone Java tool or via a configurable web
interface.

LingPipe is a suite of Java tools designed to perform linguistic
analysis on natural language data. LingPipe's flexibility and
included source make it appropriate for research use. Version 1.0
tools include a statistical named-entity detector, a heuristic
sentence boundary detector, and a heuristic within-document
coreference resolution engine

LinguaStream is an integrated experimentation environment (IEE)
targeted to researchers in Natural Language Processing.
LinguaStream allows processing streams to be assembled visually,
picking individual components in a "palette" (the standard set
contains about fifty components, and is easily extensible using a
Java API, a macro-component system, and templates). Some components
are specifically targeted to NLP, while others solve various issues
related to document engineering (especially to XML processing).
Other components are to be used in order to perform computations on
the annotations produced by the analysers, to visualise annotated
documents, to generate charts, etc.

Linkator is an application that automatically adds semantically
annotated links to existing or previously generated web pages.
Linkator uses information extraction to detect possible terms in
the text, and then uses semantic Web technologies to dereference
the links. It is limited to Firefox.

The Linked Data API is a vocabulary and processing model for a
configurable API layer intended to support the creation of simple
RESTful APIs over RDF triple stores. The API layer is intended to
be deployed as a proxy in front of a SPARQL endpoint to support: 1)
generation of documents (information resources) for the publishing
of Linked Data; 2) provision of sophisticated querying and data
extraction features, without the need for end-users to write SPARQL
queries; and 3) delivery of multiple output formats from these
APIs, including a simple serialisation of RDF in JSON syntax.

Linked Data Mapper is a browser-based semiautomatic SQL-RDF mapping
tool. It aims at providing a easy-of-use tool to help domain
experts define semantic mappings and also providing the
functionalities of converting relational data to RDF data
automatically or based on manually defined mappings

Linked Ratpack is a microframework, similar to Ruby's sinatra, and
based off of the Groovy microframework Ratpack. Linked Ratpack
allows you to quickly build linked data servers that can serve up
RDF at various URLs. The main features it provides are integrating
with Groovy SPARQL to provide an easy way to build RDF,
de-reference URIs from other linked data sources, and build RDF
from SPARQL Construct statements

LinkedMarkMail is a simple Linked Data interface for accessing the
MarkMail archives. The API offers a RDF version of the information
indexed by MarkMail; using SIOC, therefore fully compatible with
many other applications. There is an online version. The system is
provided by SWAML.

LingToRDF provides a full-featured LINQ query provider for .NET
using both local triple stores with Graph Matching and SPARQL
queries on remote stores. The project also includes the LingToRDF
Designer, which is a graphical designer to auto-generate C# entity
models as well as N3 ontology specifications from UML-like designs,
used as an extension to Visual Studio 2008 beta 2

Live Clipboard is a DHTML Web control that provides copy/paste
functionality for data associated with a web page using the Live
Clipboard XML data format. It has a UI display icon, and JavaScript
objects, serialization and callbacks.

Live OWL Documentation Environment (LODE) is a service that
automatically extracts classes, object properties, data properties,
named individuals, annotation properties, general axioms and
namespace declarations from an OWL and OWL2 ontology, and renders
them as ordered lists, together with their textual definitions, in
a human-readable HTML page designed for browsing and navigation by
means of embedded links.

Longwell is a web-based RDF-powered highly-configurable faceted
browser. It mixes the flexibility of the RDF data model with the
effectiveness of the faceted browsing UI paradigm and for
visualization and browsing. You can build a user-friendly web site
within minutes and without requiring any code at all.

Loom is a language and environment for constructing intelligent
applications. The heart of Loom is a knowledge representation
system that is used to provide deductive support for the
declarative portion of the Loom language. Declarative knowledge in
Loom consists of definitions, rules, facts, and default rules.

Loomp allows users to create semantically enhanced content and
compile content with drag and drop, to publish html, blog, wiki,
and feeds in various formats such as pdf, doc, odt, and to provide
a reader that highlights relevant information and allows navigation

LOOT is a Lightweight Object-Oriented Triplestore implemented in
Python. LOOT takes several ideas from RDF and OWL, but seeks its
own solutions. For example, LOOT has its own way of handling
ordered lists. It also treats literal values as first-class things,
so that the language and datatype of a literal can be handled just
like any other triple.

Apache Lucene is a high-performance, full-featured text search
engine library written entirely in Java. It is a technology
suitable for nearly any application that requires full-text search,
especially cross-platform. It is open source

A commercial search engine using semantic Web technologies; it can
use structured information, semantic markings and relations
knowledge of information resources for the search and presentation
of their contents

Machinese Syntax provides a full analysis of texts by showing how
words and concepts relate to each other in sentences - still with
very competitive speed and accuracy. Machinese Syntax helps
analytic applications understand text beyond the level of words,
phrases and entities: also their interrelations (such as events,
actions, states and circumstances); from Connexor

Magpie supports the interpretation of web documents through
on-the-fly ontologically based enrichment. Semantic services can be
invoked either by the user or be automatically triggered by
patterns of browsing activity

Mahout provides a wide range and many scalable machine learning
libraries. Its core algorithms for clustering, classfication and
batch based collaborative filtering are implemented on top of
Apache Hadoop using the map/reduce paradigm. Major applications
include recommendation mining, clustering, classifications and
frequent itemset mining.

Makna is a semantic wiki where each wiki instance can be configured
to use one or more ontologies that suits its topic. The engine
supports RDFS and OWL ontologies and the reasoner can be chosen as
well; it is still in active development

map4rdf is configured to use a SPARQL endpoint to provide map-based
visualization of RDF data. map4rdf uses Google Maps to model data
using either GEO (W3C) or GeoLinkedData. The viewer includes a
faceted browser interface and allows visualization of statistical
data using SCOVO. It can visualize polylines and polygons when
using GeoLinkedData

MapPSO is a tool for Ontology Alignment, which uses Discrete
Particle Swarm Optimisation. A particle swarm is used to search for
the optimal alignment. The algorithm is massively parallel and
adapts naturally on parallel architectures

Marbles is a server-side application that formats Semantic Web
content for XHTML clients using Fresnel lenses and formats. Data is
retrieved from multiple sources and integrated into a single graph
that is persisted across user sessions. Marbles follows specific
predicates found in retrieved data such as owl:sameAs and
rdfs:seeAlso in order to gain more information about a resource and
to obtain human-friendly resource labels.

Marmite is a Firefox add-on to let end-users create mashups by
making it easy to extract content from web pages, process it in a
data-flow manner, integrate it with other data sources, and direct
it to a variety of useful sinks, such as databases, map services,
and compilable source code that can be further customized; tool not
yet released.

The Mastro OBDA plugin extends Protege 4.0 with ontology-based data
integration capabilities. It adds a new tab for the definition of
different kinds of constraints and a reasoner that is able to work
in the OBDA context. The Mastro OBDA plugin is able to take the
data sources and mappings defined in an OBDA model and use them
during query answering, consistency checking and all reasoning
operations

The MatrixBrowser project presents a new approach for visualizing
and exploring large networked information structures which may
represent, for instance, linked information resources or metadata
structures such as ontologies

Meaningtool is an online Semantic Engine that categorizes content,
generates tag clouds and detects entities (such as people,
companies or places) for any given website. Works with all major
western languages. Automatically scraps the relevant text of a
site. And with its Category Manager, it's very simple to define a
custom set of categories and train them to meet specific
categorization needs.

MetaDesk is an RDF authoring tool that emphasizes entry of facts,
rather than construction of ontologies. MetaDesk places no
restrictions on vocabulary-users can invent terms on-the-fly, which
the system converts into underlying RDF structures.

MetaGlance is a automated metadata generation Web service. It is
free for light use. It gathers metadata from most types of files,
such as PDFs, Microsoft Word documents, Web pages, SCORM packages.
When metadata is not available or is incomplete, MetaGlance uses a
series of algorithms to generate metadata from the text. Results
are provided in JSON

METAmorphoses is a set of tools for flexible and easy-to-use
generation of RDF metadata directly from a relational database.
Metadata are genereated according to the mapping from an existing
database schema to a particular ontology.

MindRaider is Semantic Web outliner. It aims to connect the
tradition of outline editors with emerging technologies. MindRaider
mission is to organize not only the content of your hard drive but
also your cognitive base and social relationships in a way that
enables quick navigation, concise representation and inferencing

Mint is an application that complements ReDBox, and acts as a
clearinghouse for other data sources. It is used to mint Linked
Data identifiers (HTTP URIs which basically means URLs) for things
and concepts that don't have them yet

mle is a Semantic Web tool that allows the exploration of mailing
lists via query, timeline view, etc. A valid W3C public mailing
list URI is entered and used to retrieve the content of the given
mailing list, augment it with metadata using the SIOC vocabulary
and deploy it with XHTML + RDFa.

MnM is an annotation tool which provides both automated and
semi-automated support for annotating web pages with semantic
contents. MnM integrates a web browser with an ontology editor and
provides open APIs to link to ontology servers and for integrating
information extraction tool

MOAW (pronounce like mauve: |mõv|) can be seen as a URI suggestion
tool, building on the ``auto-completion'' feature made popular by
Web 2.0 websites and Google keyword suggestion. Basically, MOAW can
be attached to any text field (HTML input element) so that, while
typing, suggestions of URIs would appear that can be selected to
replace the corresponding word. The URIs suggested by MOAW are
discovered thanks to Watson.

Simple OWL tools, featuring UML (XMI), ErWin, thesaurus and
imports. The editor is tree-based and has a “navigator” tool for
traversing property and class-instance relationships. It can import
XMI (the interchange format for UML) and Thesaurus Descriptor
(BT-NT XML), and EXPRESS XML files. It can export to MS Word.

MoKi, the MOdelling wiKI, is a modeling tool that supports the
creation of integrated domain and process models through structured
wiki pages. MoKi enables heterogeneous teams of experts to actively
collaborate. It has import and export in OWL and BPMN and graphical
browsing / editing of models.

mopy is the Music Ontology Python library, designed to provide easy
to use python bindings for ontology terms for the creation and
manipulation of music ontology data. mopy can handle information
from several ontologies, including the Music Ontology, full FOAF
vocab, and the timeline and chord ontologies.

Moriarty is a simple PHP library for accessing the Talis Platform.
It follows the Platform API very closely and wraps up many common
tasks into convenient classes while remaining very lightweight. It
also provides some simple RDF classes that are based on the
excellent ARC2 class library.

Morla is an editor of RDF documents that allows you to manage more
RDF documents simultaneously, visualize graphs, and use templates
for quick writing. You can import RDFS documents and use their
content to write new RDF triples. Templates are also RDF documents,
and they make Morla easily personalizable and expandable. You can
also use Morla as an RDF navigator, browsing the RDF documents
present on the Internet exactly as you are used to doing with
normal browsers

Morph is a Web service for converting semantic web data from one
format to another. RDF/XML, Turtle and RDF/JSON can all be
converted into one another, converted to TriX, Exhibit/JSON, or RSS
1.0, or rendered as HTML. SPARQL Results XML, and Facets XML, can
be rendered as HTML, or converted to JSON

Morpho allows you to create and manage your data, and to share it
with others. It was created to provide an easy-to-use,
cross-platform application for accessing and manipulating metadata
and data (both locally and on the network). It has been used
principally by ecologists, but is applicalbe to any domain.

myOntology is to produce the theoretical foundations, and
deployable technology for the Wiki-based, collaborative and
community-driven development and maintenance of ontologies instance
data and mappings

Neno is an RDF progamming language designed specifically for the
Fhat RDF virtual machine (RVM) (see other entry). It is an RDF
model of object-oriented programming. RDF triple-code is analogous
to virtual machine byte-code, but instead of being represented as
bits, bytes, and words, it is represented as URIs and triples. Neno
is to be released in Fall 2007.

Neologism is a simple, lightweight, Web-based RDF Schema vocabulary
editor and publishing system built on the Drupal CMS. Its goal is
to reduce the time required to create, publish and modify
vocabularies for the semantic Web. The system is not yet open for
use, but there is a demo and screencast.

The NeOn Toolkit is an extensible Ontology Engineering Environment
based on Eclipse. It is part of the reference implementation of the
NeOn architecture. It contains plugins for ontology management and
visualization with core features in basic editing,
visualization/browsing, and import/export of F-Logic, (subsets of)
RDF(S) and OWL. A number of commercial plugins extend the toolkit

NKE (Navigational Knowledge Engineering) is a lightweight
methodology for low-cost knowledge engineering by many potential
contributors. Using NKE, data providers can publish flat data on
the Web without extensively engineering structure upfront. The
vision of NKE is to produce ontologies as a result of users
navigating through a system. There is also a Web demo
(http://hanne.aksw.org/) and some mockups
(http://aksw.org/Projects/NKE#Mockups).

nl is a library that exposes a declarative API that allows us to
build sentences and rules. These are used as input for a knowledge
base built on the CLIPS production system. CLIPS builds a Rete
network with the rules and sentences, which can then be queried for
the consecuences of those in a most efficient way. nl offers a
syntax that can accommodate any coherent theory that we may build
with the natural language (in the same sense as something like the
semantic web's OWL-Full would), while at the same time being based
on a simple finite domain first order theory

NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing. NLTK contains Code supporting
dozens of NLP tasks, along with 30 popular Corpora and extensive
Documentation including a 360-page online Book. Distributions for
Windows, Mac OSX and Linux are available

NOR2O is a library for transforming non-ontological resources to
ontologies. The library performs an ETL process. The NOR Connector
loads classification schemes, thesauri, and lexicons modelled with
their corresponding data models, and implementations. The
Transformer performs the transformations by implementing the
sequence of activities included in the patterns. This module
interacts with the Semantic Relation Disambiguator module for
obtaining the suggested semantic relations of the NOR elements. The
OR Connector generates the ontology using the OWL API.

OBDA (Ontology Based Data Access) Plugin is an add-on for the
Protege ontology editor aimed at transforming Protege into a fully
fledged OBDA (Ontology Based Data Access) model editor. It provides
data source and mapping editors, as well as querying facilities
that in conjunction with an OBDA-enabled reasoner, allow you to
design and test every aspect of an OBDA system

OBO-Edit is an open source ontology editor written in Java.
OBO-Edit is optimized for the OBO biological ontology file format.
It features an easy to use editing interface, a simple but fast
reasoner, and powerful search capabilities.

Ocelot is a Firefox extension, browser based ontology viewer. There
is also meant to be a servlet version of the viewer. Ocelot
displays a content summary of the document, as well as an
object-based viewer for graph navigation. See
http://ocelot.semwebcentral.org/ for an online viewer

The “Online Cultural Heritage Research Environment” (OCHRE) is an
Internet database system for research on cultural heritage in all
its forms. It is intended for researchers and students who are
engaged in archaeological, anthropological, linguistic, and textual
investigations of many different kinds.

OCS (Ontology Creation System) is a knowledge acquisition system,
designed to provide common platform for accessing ontologies. OCS
is working as a JEE application. Ontologies are stored in database
using RDF triples. Everything is accessible through a special
designed API, which keeps ontologies in object model. It is based
on OWL API project.

OLA (OWL-Lite Alignment) matches ontologies written in OWL. It
relies on a similarity combining all the knowledge used in entity
descriptions. It also deal with one-to-many relationships and
circularity in entity descriptions through a fixpoint algorithm.

OWLTools (aka OWL2LS - OWL2 Life Sciences) is a convenience Java
API on top of the OWL API. It provides many features, including: 1)
convenience methods for OBO-like properties such as synonyms,
textual definitions, obsoletion, replaced_by; 2) simple graph-like
operations over ontologies; and 3) visualization using the QuickGO
graphs libraries

The OMCSNet-WordNet project aims to improve the quality of the
OMCSNet dataset by using automated processes to map WordNet synonym
sets to OMCSNet concepts and import additional semantic linkage
data from WordNet. It is based on OMCSNet 1.2, a semantic network
and inference toolkit written in Python/Java. OMCSNet currently
contains over 280,000 separate pieces of common sense information
extracted from the raw OMCS dataset. This project is also based on
WordNet, an online lexical reference system that in recent years
has become a popular tool for AI researchers

omnidator takes input documents that uses Schema.org terms and
turns it into others kind of data formats. Currently, inputs of
microdata and CSV, with outputs of JSON and RDF serialisations
(RDF/XML and RDF/Turtle) are supported

The Omnigator is a form-based manipulaton tool centered on Topic
Maps, though it enables the loading and navigation of any
conforming topic map in XTM, HyTM, LTM or RDF formats. There is a
free evaluation version.

The One Click Annotator (OCA) is a tool to help create semantic
content by non-expert users. The OCA is an editor for Web browsers
for annotating words and phrases with references to ontology
concepts. Our main design goal is to simplify the annotation
process and provide a tool that non-expert users can easily use to
create semantic content

OntoBroker is middleware for the construction of semantic
applications. Its inference engine processes ontologies and the
logic represented inside them. OntoBroker is the core component in
most of Ontroprise's offerings. OntoBroker provides a Web Service
interface and can also be deployed in application servers.

OntoCAT or Ontology Common API Tasks is a Java library aiming to
overcome the hurdle of querying multiple existing ontology
resources. It supports both major ontology browsers: NCBO BioPortal
and EBI Ontology Lookup Service (OLS), providing a convenient
wrapper around their Web Services. It also supports querying of
local OWL and OBO resources making use of the OWL API

OntoComP is a Protégé 4 plugin for completing OWL ontologies. It
enables the user to check whether an OWL ontology contains "all
relevant information" about the application domain, and extend the
ontology appropriately if this is not the case

Ontogator is a generic view-based RDF search engine. It defines and
implements an RDF-based query interface that is used to separate
view-based search logic from the user interface. The interface is
defined as an OWL ontology

OntoGen is a semi-automatic and data-driven ontology editor
focusing on editing of topic ontologies (a set of topics connected
with different types of relations). The system combines text-mining
techniques with an efficient user interface. It requires .Net.

Ontolingua provides a distributed collaborative environment to
browse, create, edit, modify, and use ontologies. The server
supports over 150 active users, some of whom have provided us with
descriptions of their projects. Provided as an online service;
software availability not known

OntoMedia manages large multimedia collections using semantic
integration techniques for metadata using ontologies. The user can
organize multimedia collections with a graphical user interface
which includes easy metadata indexing and search capabilities. It
includes universal playback for many multimedia formats, including
MP3, DivX, and Quicktime

ontopia is a relative complete suite of tools for building,
maintaining, and deploying Topic Maps-based applications; open
source, and written in Java. Could not find online demos, but there
are screenshots (http://code.google.com/p/ontopia/wiki/Screenshots)
and there is visualization of topic relationships

Ontos API is a public web service which returns rich semantic
metadata for input text content you submit. Ontos recognizes
entities and relations between them using natural language
processing techniques. Although basic types of entities (people,
companies, places etc.) are pre-defined, the user can also create
OWL-driven dictionaries for custom types of entities, merge
entities across documents, etc. Presently in beta

OntoSearch is an ontological search engine designed to help users
find RDF based ontological information on the Semantic Web. It uses
the Google API to search the available ontologies in the RDF(S),
OWL and DAML (+OIL) representational formalisms

OntosMiner uses ontologies to define the areas of interest and
linguistic rules of NLP to analyze natural language text. Objects
and their relations are automatically realized and annotations to
the related text fragments are added. The outcome is a set of
searchable and conceptually structured data, which can be
categorized, browsed and visually presented in semantic networks.

OntoStudio is a professional developing environment for
ontology-based solutions. It combines modelling tools for
ontologies and rules with components for the integration of
heterogeneous data sources. As ontology-languages OntoStudio
supports W3C-standards OWL and RDF(S) and F-Logic for the
logic-based processing of rules. OntoStudio comes along with many
connectors to databases, documents, file-systems, applications and
web-services. 3-mo, royalty-free test available

OntoTrack is a browsing and editing ontology authoring tool for OWL
Lite. It combines a sophisticated graphical layout with mouse
enabled editing features optimized for efficient navigation and
manipulation of large ontologies

OntoViz is an attempt to enable the user to visually browse and
navigate large and complex ontologies containing millions of terms
and various relationships between them. It supports ontologies via
a Web service, stored in databases, or stored in files

Oort (Out of RDF Transmogrifier) is a Python-based toolkit for
accessing RDF graphs as plain objects. The purpose of Oort is to
make it easy to create web views of RDF Graphs by using some
declarative python programming. OortPub uses RDFLib, Paste and
Genshi as its core components. See also
http://code.google.com/p/oort/

The Open Anzo project is an open source enterprise-featured RDF
store and middleware platform that provides support for multiple
users, distributed clients, offline work, real-time notification,
named-graph modularization, versioning, access controls, and
transactions with preconditions. The project includes Glitter, a
SPARQL engine independent of any particular back-end.

OpenAmplify is a Web service that uses patented computational
linguistics technology to analyze text and surface its meaning. Its
supports SOAP, REST and other standards with output in XML or JSON.
Up to 1000 transactions per day are free.

The OpenCyc KB Browser is the main interface tool for accessing the
Cyc Knowledge Base (Cyc KB). It provides a means for browsing the
KB in a number of different ways, a means for querying the KB, and
(for registered users) a means for modifying or adding to the KB
itself.

IRIS is a semantic desktop application framework that enables users
to create a “personal map” across their office-related information
objects. IRIS includes a machine-learning platform to help automate
this process

The OpenLink Data Explorer, or ODE, (formerly known as the OpenLink
RDF Browser) is an RDF data browser provided as a Firefox plug-in.
It is implemented using the OpenLink AJAX and AJAR Toolkit (OAT).
ODE provides a Web browser driven interface for interacting with
RDF-based linked data via a combination of (X)HTML, AJAR, and
hyperdata links.

ODS is a distributed collaborative application platform for
creating Semantic Web applications such as: blogs, wikis, feed
aggregators, etc., with built-in SPARQL support and incorporation
of shared ontologies such as SIOC, FOAF, and Atom OWL. ODS is an
application of OpenLink Virtuoso and is available in Open Source
and Commercial Editions.

OpenQabal is an open-source social-networking and collaboration
platform / suite driven by the principles of federation,
composition, and openness; with a special emphasis on enabling
"distributed conversations" and the "federated social graph." It
leverages emerging Semantic Web technologies to build a richer
model for linking blogs, to allow conversations to span multiple
domains and providers; it has impressive support for virtually all
Web protocols and standards

OpenRecord is similar to a wiki, but with some database features
added in. Wiki systems usually let you organize your content as
pages of text, or pages of rich text. In OpenRecord your content is
organized as a database of items, or records. Each page on an
OpenRecord site can query the database to get some set of items,
and those items can be displayed in an editable table, or in an
outline format, or in other formats.

OPOSSum is a database of semantic Web service descriptions. The
majority of services currently listed result either from the
integration of existing SWS collections (OWLS-TC, SWS-TC,
SAWSDL-TC) or an effort for building a service collecting based on
real services from the geocoding domain.

Opossum is a search engine for Web services (semantic + WSDL) that
lets you find single and composed services using simple keyword
search. Its novelty is in its simple interface, its indexing
techniques and its semantic approximation methods

OPPL2 (ontology pre-processing language) is an abstract formalism
that allows for manipulating ontologies written in OWL. It is 100%
based on the Manchester OWL Syntax; a query language based on OWL
(logical) axioms and variables; a scripting language that allows
the addition/removal of OWL (logical) axioms. It is available as an
Protégé 4.1 plug-in

ORE stands for ontology repair and enrichment. It is a tool for
knowledge engineers to improve an OWL ontology through a wizard
like repair process. It uses state-of-the-art methods for fixing
inconsistencies and suggesting additions to an ontology, while
still being efficient for small and medium sized ontologies.

Oroboro is a lightweight Java RDF processing framework developed
with simplicity of use in mind and a minimalist attitude. Its
design focuses on conversion and data integration tasks involving
moderate datasets, targeting common use cases while trying to
remain as flexible as possible.

A Java interface and implementation for the W3C Web Ontology
Language (OWL), used to represent Semantic Web ontologies. The API
is focused towards OWL Lite and OWL DL and offers an interface to
inference engines and validation functionality

OWL Validator - A tool to check OWL markup for problems beyond
simple syntax errors. It examines OWL content for a variety of
potential errors and reports them along with the location of the
errors in the files

OWL2Prefuse is a Java package which creats Prefuse graphs and trees
from OWL files (and Jena OntModels). It takes care of converting
the OWL data structure to the Prefuse datastructure. This makes it
is easy for developers, to use the Prefuse graphs and trees into
their Semantic Web applications.

OWLdiff is a utility to compare and merge two ontologies, similar
to diff for textual files. OWLdiff uses the Pellet reasoner to
check if the two ontologies are semantically equivalent. If not, it
shows the differences graphically in two trees; the user can then
select differing items in either tree for updating in a merge. Is
based off of the OWL2 API (v 3.2.2)

OWLGrEd is a UML style graphical editor for OWL which not only
visualizes ontologies using extended UML class diagram notation but
also provides ontology editing facilities unavailable in most of
the other tools. Moreover, the editor contains additional features
for graphical ontology exploration and development including
interoperability with Protégé

OWLJessKB is a description logic reasoner for OWL. The semantics of
the language is implemented using Jess, the Java Expert System
Shell. Currently most of the common features of OWL lite, plus some
and minus some

The OWLlink API is a Java interface and implementation of the
OWLlink protocol on top of the Java-based OWL API] The OWLlink API
enables OWL API-based applications to access remote reasoners
(so-called OWLlink servers), and it turns any OWL API aware
reasoner into an OWLlink server

The hybrid semantic Web service matchmaker OWLS-MX 1.0 utilizes
both description logic reasoning, and token based IR similarity
measures. It applies different filters to retrieve OWL-S services
that are most relevant to a given query

paggr is a microcontent remixer that will allow you to aggregate,
monitor, enrich, integrate, and re-purpose structured web data.
paggr combines innovative Semantic Web technology with pragmatic
efforts such as microformats, RSS, and Live Clipboard.

Parliament is an open source knowledge base that implements a
high-performance storage engine, compatible with the RDF and OWL
standards. Asio Parliament is typically paired with a query
processor, such as Sesame or Jena, to implement a complete data
management solution that complies with the RDF, OWL, and SPARQL
standards for data representation, ontology, and query,
respectively.

Parrot is an RIF and RDF Ontologies documentation tool. It provides
users (both business users and developers) with useful reference
documentation about rulesets and ontologies expressed in standard
languages, such as RDFS, OWL or RIF

PCPACK5 is an integrated suite of 10 knowledge tools designed to
support the acquisition and use of knowledge. It has the
flexibility to support various methodologies such as CommonKADS,
MOKA, 47-Steps and SPEDE. PCPACK5 is a network-enabled application
that allows knowledge bases to be stored and accessed by multiple
users over a network. Each user has defined access rights that
allow or restrict their ability to edit and view certain knowledge
bases.

Pedro is an application that creates data entry forms based on a
data model written in a particular style of XML Schema. Users can
enter data through the forms to create data files that conform to
the schema. They can use controlled vocabularies to mark-up text
fields and have the application perform basic validation on field
data

Picky is a semantic text search engine for categorized data, such
as varchar fields from a database. The standalone server offers a
HTTP interface returning clean JSON, and a Web front end. There's
also a statistics web app and a live interface for changing
parameters inside the running server

Piggy Bank is a Firefox extension that turns your browser into a
mashup platform, by allowing you to extract data from different web
sites and mix them together. Piggy Bank also allows you store this
extracted information locally for you to search later and to
exchange at need the collected information with others

PinQA is a location based Q/A service in Japanese. Questions
related to geo positions are answered with a map view. In additon
to HTML based pages, it also returns RDF representations of these
QAs, based on content negotiation or explicit extension. The RDF
data contains the question description, geo: coordinates, nearby
stations from DBpedia, and links to related QAs.

Using the Yahoo! Pipes editor, you can fetch any data source via
its RSS, Atom or other XML feed, extract the data you want, combine
it with data from another source, apply various built-in filters
(sort, unique, count, truncate, union, join, as well as
user-defined filters), and apply simple programming tools like for
loops. In short, it’s a good start on the Unix shell for mashups

Platypus Wiki is an enhanced Wiki Wiki Web with ideas taken from
Semantic Web. It offers a simple user interface to create a Wiki
Page plus metadata according with W3C standards. It uses RDF/RDFS
and OWL to create ontologies and manage metadata

Polaris, is a lightweight Java library aimed to be used with such
SPARQL protocol compliant endpoint in a easy way. Polaris is
intended to be used with a generic triple store as a JDBC driver is
used with a relational DB.

PoolParty is a product family that supports enterprise information
management and web mining. Metadata management is based on SKOS.
The product family consists of three parts: 1) PoolParty Thesaurus
Manager
(http://poolparty.biz/products/poolparty-thesaurus-manager/); 2)
PoolParty Extractor
(http://poolparty.biz/products/poolparty-extractor/); and 3)
PoolParty Semantic Search
(http://poolparty.biz/products/poolparty-semantic-search/). In
addition, there are add-ons for using Pool Party with WordPress and
SKOSsy (http://poolparty.biz/try-it/skossy/), which generates
DBpedia-derived thesauri in German or in English for individual
domains

Protege+OWL+Ruby (POR) Utilities provides an ontology, a set of
ruby classes and methods to simplify the development of Protege+OWL
Ontology Driven applications. At the moment project is limited to
JRuby

Posh, the Prolog OWL Shell is an interactive toplevel
readeval-print-loop interface that provides powerful capabilities
for querying and transforming ontologies. It includes a bridge to
the OWLAPI and to multiple OWL reasoners, and allows a mixture of
closed-world rule-based querying on top of open world reasoning. It
also provides an interface to POPL, the Prolog Ontology Processing
Language

Potluck is a Web-based user interface that lets casual users—those
without programming skills and data modeling expertise—mash up data
themselves. Potluck is novel in its use of drag and drop for
merging fields, its integration and extension of the faceted
browsing paradigm for focusing on subsets of data to align, and its
application of simultaneous editing for cleaning up data
syntactically. Potluck also lets the user construct rich
visualizations of data in-place as the user aligns and cleans up
the data.

Prefuse is a set of software tools for creating rich interactive
data visualizations. The original prefuse toolkit provides a
visualization framework for the Java programming language. Prefuse
supports a rich set of features for data modeling, visualization,
and interaction. It provides optimized data structures for tables,
graphs, and trees, a host of layout and visual encoding techniques,
and support for animation, dynamic queries, integrated search, and
database connectivity.

PRIOR+ is a generic and automatic ontology mapping tool, based on
propagation theory, information retrieval technique and artificial
intelligence model. The approach utilizes both linguistic and
structural information of ontologies, and measures the profile
similarity and structure similarity of different elements of
ontologies in a vector space model (VSM).

Profium's Metadata Server is a content management system using
standardized metadata which improves information reusability and
allows the user to process and distribute further information
acquired from numerous sources in different formats

The PROMPT tab plug-in for Protégé is for managing multiple
ontologies by comparing versions of the same ontology, moving
frames between included and including project, merging two
ontologies into one, or extracting a part of an ontology.

Protégé is a free, open source visual ontology editor and
knowledge-base framework. The Protégé platform supports two main
ways of modeling ontologies via the Protégé-Frames and Protégé-OWL
editors. Protégé ontologies can be exported into a variety of
formats including RDF(S), OWL, and XML Schema. There are a large
number of third-party plugins that extends the platform's
functionality.

PROXIMITY is an open-source system for relational knowledge
discovery designed and implemented by the Knowledge Discovery
Laboratory in the Department of Computer Science at the University
of Massachusetts Amherst

Pubby is a simple Java-based web application that puts a Linked
Data compatible interface in front of a SPARQL endpoint. Pubby
handles requests to the mapped URIs by connecting to the SPARQL
endpoint, asking it for information about the original URI, and
passing back the results to the client. It also handles various
details of the HTTP interaction, such as the 303 redirect required
by Web Architecture, and content negotiation between HTML, RDF/XML
and N3 descriptions of the same resource

Pushback aims at turning the current 'read-only' Semantic Web into
a read/write Semantic Web. In pushback, a vocabulary is defined and
a process is created to create to produce RDForms (HTML form + RDF
data), which are then used to write back changes to the Web 2.0
sources via an RDF wrapper.

Pychinko is a Python implementation of the Rete algorithm, an
efficient way to apply forward-chaining rules to a set of facts.
Pychinko employs an optimized implemention of the algorithm to
handle facts expressed as triples, and process them using a set of
N3 rules

pyroo is the Python implementation of the 'trioo' project, which
has its objective to easily manage RDF data directly from
object-oriented programming languages, without negative impacts on
their object-oriented designs while still keeping the semantics of
the data accurate

pyrple parses RDF/XML, N3, and N-Triples. It has in-memory storage
with API-level querying, experimental marshalling, many utilities,
and is small and minimally interdependent. It can do graph
isomorphism testing, rule application, etc.

QEDWiki is a lightweight mash-up maker written in PHP 5 and hosted
on a LAMP, WAMP, or MAMP stack. A mash-up assembler will use
QEDWiki to create a personalized, ad hoc Web application or mash-up
by assembling a collection of widgets on a page, wiring them
together to define the behavior of the mash-up application, and
then possibly sharing the mash-up with others

Queso is the Web application framework component of the IBM
Semantic Layered Research Platform. It uses the Atom Publishing
Protocol to allow a browser-based Web application to read and write
RDF data from a server

Quest is a reasoner based on OBDA (Ontology Based Data Access). The
core of the system implements techniques for OBDA optimization.
Quest's reasoning engine is based on optimized DL-Lite
query-rewriting. Quest can be queried by SPARQL or SQL, and can tie
into any JDBC data source. It also has an interface to the OWL API

QuiKey is a light-weight tool that can act as an interactive
command-line for a semantic knowledge base. It focuses on highest
interaction-efficiency to browse, query and author graph-based
knowledge bases in a step-by-step manner. It combines ideas of
simple interaction techniques like auto-completion, command
interpreters and faceted browsing and integrates them to a new
interaction concept. QuiKey is currently bundled with the iMapping
application

The R2R Framework enables Linked Data applications to search the
Web for mappings and apply the discovered mappings to translate Web
data. The Framework provides the R2R Mapping Language; defines best
practices; and provides an open-source implementation of the R2R
Mapping Engine

RADAR is a comprehensive set of components for semi-automatic
cognitive learning. Machine learning has been developed to the
point where it can perform some truly useful tasks. However, much
of the learning technology that's currently available requires
extensive 'tuning' in order to work for any particular user, in the
context of any particular task. RADAR's components are designed to
overcome these limitations

RADiFy is a bookmarklet for the purposes of annotating a web page
with RDFa. RDFa is the Resource Description Framework in
attributes. This means that we can describe data on a web page in a
machine-readable format from the source.

The RDF server of the PHP RAP environment. The RAP NetAPI provides
an subset implementation of the W3C member submission RDF NetAPI
and of a subset of the SPARQL Protocol. The NetAPI allows you to
run an RDF server, with similar functionality as the Joseki RDF
server, on web servers that provide PHP support

RapidMiner (formerly YALE) and its plugins provide more than 400
operators for all aspects of data mining useful for business
intelligence or maching learning. Key plugins in text, CRF, and
others exist. A huge amount of visualization techniques and the
possibility to place breakpoints after each operator give insight
into the success of your design - even online for running
experiments

RAPPTOR is a semantic technologies-enabled Web application server
for the .Net platform. It integrates relational, hierarchical, and
even semantic data structures into a common API and offers broad
DBMS connectivity through pluggable drivers. It is designed for
rapid application development, a code-driven development style, and
platform-independence (Mono and MS .Net 1.0/1.1/2.0/3.0)

The Raptor RDF parser toolkit is a free software / Open Source C
library that provides a set of parsers and serializers that
generate Resource Description Framework (RDF) triples by parsing
syntaxes or serialize the triples into a syntax. The supported
parsing syntaxes are RDF/XML, N-Triples, Turtle, RSS tag soup
including Atom 1.0 and 0.3, GRDDL for XHTML and XML. The
serializing syntaxes are RDF/XML (regular, and abbreviated),
N-Triples, RSS 1.0, Atom 1.0 and Adobe XM

Rasqal is a C library for querying RDF, supporting the RDQL and
SPARQL languages. It provides APIs for creating a query and parsing
query syntax. It features pluggable triple-store source and
matching interfaces, an engine for executing the queries and an API
for manipulating results as bindings. It uses the Raptor RDF parser
to return triples from RDF content and can alternatively work with
the Redland RDF library's persistent triple stores. It is portable
across many POSIX systems

The RDF CCK module for Drupal allows site administrators to map
each content type, node title, node body and CCK field to an RDF
term (class or property). By default, RDF CCK will create local
classes and properties for all your content types and fields which
will be exported at node/*/rdf

Being able to reuse RDF vocabularies across sites is one of the key
elements for the semantic web to take off. The RDF external
vocabulary importer module (evoc) for Drupal caches any external
RDF vocabulary and provides properties to be mapped to CCK fields,
node title and body. This module requires the RDF and the SPARQL
modules

This program acts as a filter layer between SAX (The Simple API for
XML) and the higher-level RDF (Resource Description Format), an
XML-based object-serialization and metadata format. The RDF filter
library is used by several RDF-based projects

RDF Gravity is a tool for visualising RDF/OWL Graphs/ ontologies
including graph visualization, global and local filters, full-text
search, generating views from RDQL queries, and visualising
multiple RDF files. It is implemented by using the JUNG Graph API
and Jena semantic web toolkit

The RDF Metadata Module for Drupal allows you to apply RDF
statements to content nodes, specifying any number of relationships
between different documents or URLs. Relationships are modelled on
the Dublin Core Metadata definitions, and can support arbitrary
ontologies

The RDF extension for Google Refine (RDF-Refine) adds a graphical
user interface(GUI) for exporting data of Google Refine projects in
RDF format. The export is based on mapping the data to a template
graph using the GUI.

The RDF SPARQL Endpoint module for Drupal indexes the RDF data
publicly available on a Drupal site into an ARC2 RDF store. It also
provides a SPARQL endpoint via the ARC2 SPARQL interface. The
SPARQL module should be used instead if another database support is
required.

The RDF SPARQL Proxy module for Drupal allows to instantiate RDF
resources on demand (lazy loading) via the remote data schema to
the local Drupal RDF schema defined by the RDF API and RDF CCK .
When visiting a node, it is synchronized with the RDF endpoint
where the data originates from.

RDF::Notation3 is an RDF/N3 parser; it can parse N3 files or
strings and provide results in whatever format (as far as there are
subclasses supporting your desired format available). It is a base
class providing the parsing logic only, and more likely serves as a
parser for other Perl RDF modules (such as RDF::Core) focused on
how to store and access RDF models

RDF123 is an application and web service for converting data in
simple spreadsheets to an RDF graph. Users control how the
spreadsheet's data is converted to RDF by constructing a graphical
RDF123 template that specifies how each row in the spreadsheet is
converted as well as metadata for the spreadsheet and its RDF
translation.

This tool converts RDF/XML or N3 content into dataRSS feeds as
accepted by Yahoo SearchMonkey. The tool can be used to feed
GoodRelations-based e-commerce descriptions into the Yahoo family
of technology

RDFa² is an on-line service which does multi-topic annotation with
RDFa. With RDFa² the generated Web page can be revised using the
underlying XHTML+RDFa source. Alternatively, you can revise the
intermediate template and then save it to your local machine for
future reuse.

RDFaCE is an online RDFa content editor based on TinyMCE. It
supports different views for semantic content authoring and uses
existing semantic Web APIs to facilitate annotating and editing of
RDFa contents. RDFaCE combines the results of multiple NLP APIs,
currently including Alchemy,Extractiv, Open Calais, Ontos, Evri and
DBpedia Spotlight

This Java server creates and manages knowledge exchange groups.
Servers can go "public" (and therefore appear automatically in the
DBin list of available servers) or be used for private purposes. To
connect to one such private server from DBin, use the "add manual
server" feature.

RDFParser is fully compliant with the RDF/XML specification. The
parser passes all of the positive parser test cases from the W3C.
It has only been tested in Firefox 1.5 but should work in any
browser that supports the DOM Level 2 specification

RDFStats generates statistics for datasets behind SPARQL-endpoints.
The generator runs as a stand-alone process (e.g. beside a native
RDF Store, D2R-Server instance or any other SPARQL end-point). It
can also be embedded into other applications. It is based on Jena

RDig provides an HTTP crawler and content extraction utilities to
help building a site search for web sites or intranets. Internally,
Ferret is used for the full text indexing. After creating a config
file for your site, the index can be built with a single call to
rdig; it is written in Ruby

ReDBox is Web-based metadata registry application, similar to an
institutional repository, for describing research data. It follows
linked data principles and is built on the Fedora Commons
repository. It also uses the high performance Apache Solr text
indexer to drive the portal interface rather than an RDF triple
store

The ReDeFer project employs a “XML Semantic Reuse Methodology” that
combines an XML Schema to OWL web ontology mapping, called XSD2OWL,
with a mapping from XML to RDF, XML2RDF. The ontologies generated
by XSD2OWL are used during the XML to RDF step in order to generate
RDF data enriched by the XML Schema semantics made explicit.

Redfoot is a hypercoding system which is being used to create a
webized operating system and is also being used to create
applications. It is built around the notion of an RDF Graph for
persistence rather than a File Tree

The Redland RDF Application Framework is a set of free software
libraries that provide support for RDF. It provides parser for
RDF/XML, Turtle, N-triples, Atom, RSS; has a SPARQL and GRDDL
implementation, and has language interfaces to C#, Python, Obj-C,
Perl, PHP, Ruby, Java and Tcl

Refinder is an online collaboration tool for teams to collect and
share information. Teams use it to collect information from
different desktop and online applications in one place, filter the
data, and find relevant information.

Rej is a graphical utility that allows you to write RDF statements
directly, using a simple layout that is designed to naturally
reflect the basic RDF model. You might use Rej in any situation
that uses RDF as part of its pipeline, or as a general purpose
annotation tool, such as when taking notes or doing research

RELExO (Relational Exploration for Learning Expressive Ontologies)
is a tool for reasoner-aided relational exploration of OWL DL
ontologies, that has been developed to support the acquisition and
refinement of complex class descriptions

RelFinder extracts and visualizes relationships between given
objects in datasets and makes these relationships interactively
explorable. Highlighting and filtering features support analysis
both on a global and detailed level.

REMBRANDT is a named entity recognition tool that identifies and
classifies all named entities (NE) in the text (that is, entity
names such as proper names, places or organizations), and detects
the relations among NEs. REMBRANDT is prepared to classify entities
that have potentially different meanings, and disambiguates its
meaning whenever possible.

ReTAX is an aide to help a taxonomist create a consistent taxonomy
and in particular provides suggestions as to where a new entity
could be placed in the taxonomy whilst retaining the integrity of
the revised taxonomy (c.f., problems in ontology modelling).

ReVerb is a program that automatically identifies and extracts
binary relationships from English sentences. ReVerb is designed for
Web-scale information extraction, where the target relations cannot
be specified in advance and speed is important

Rex is a component of the Revelytix Emergent Analytics Software
Suite. It will execute any rules written using a Rules Interchange
Format (RIF) dialect (currently we support the BLD dialect, plus
the NOT operator)

Rhizome is a Wiki-like content management and delivery system that
exposes the entire site including content, structure, and metadata
as editable RDF. This means that instead of creating a site with
URLs that correspond to a page of HTML, you can create URLs that
represent just about anything. It was designed to enable
non-technical users to create these representations in an easy,
ad-hoc manner. For developers, this allows both content and
structure to be easily repurposed and complex Web applications to
be rapidly developed

RiMOM (Risk Minimization based Ontology Mapping) integrates
different alignment strategies: edit-distance based strategy,
vector-similarity based strategy, path-similarity based strategy,
background-knowledge based strategy, and three
similarity-propagation based strategies.

Ripple is a scripting language expressed in RDF lists. Its scripts
both operate upon and are made up of RDF metadata, extending the
idea of HTTP. The Java implementation of Ripple includes a query
engine, a provisonal assortment of primitive functions, and an
interactive interpreter which parses commands and queries in a
readable, Turtle-like format

RKBGadgets give access to the Linked Data of the RKB Platform,
enabling the many knowledge bases and external Linked Data sources
to be conveniently searched and queried, with the results presented
in a consolidated view. Details of people and other artefacts of
academic research can be found, and the relationships between them
displayed.

ROC (Rapid Ontology Construction) is a tool that allows domain
experts to quickly build a basic vocabulary for their domain,
re-using existing terminology whenever possible. How this works is
that the ROC tool asks the domain expert for a set of keywords that
are 'core' terms of the domain, and then queries remote sources for
concepts matching those terms. These are then presented to the
user, who can select terms from the list, find relations to other
terms, and expand the set of terms and relations, iteratively. The
resulting vocabulary (or 'proto-ontology', basically a SKOS-like
thesaurus) can be used as is, or can be used as input for a
knowledge engineer to base a more comprehensive domain ontology on.

ROO (Rabbit to OWL Ontology Authoring) is an OWL ontology
construction tool based on Protege 4 that has been designed to be
used by users with limited or no ontology engineering experience.
ROO provides guidance about the steps involved and how to enter
knowledge using a controlled natural language

The Rule Manager allows business users to construct, animate,
validate and visualize business rules without writing any technical
code. It uses English-friendly constructions and does not require
semi-colons, quotes, or other technical constructions. It is based
on RuleML and uses .Net

Rx4RDF shields developers from the complexity of RDF by enabling
you to use familar XML technologies like XPath, XSLT and XUpdate to
query, transform and manipulate RDF. Also included is Rhizome, a
wiki-like application for viewing and editing RDF models

S3DB is a web service application with an explicit semantic data
model. Specifically, the core model, described in RDFS,
distinguishes the domain of discourse from its instantiation. This
distinction enables the explicit inclusion of a permission
inheritance component. As a consequence, the access to individual
data elements is resolved on the server side rather than at the
access point. Data formats supported include tabular text, HTML,
XML, XML/RDF, N3/RDF, and JSON. The S3DB code also includes a basic
web-based database management system application (DBMS). See also
http://code.google.com/p/s3db/

SAHA is a browser-based metadata editor and annotation tool for
creating data in RDF format for the semantic Web. Its main function
is to provide a distributed environment where multiple users can
simultaneously annotate and edit RDF data through a Web interface

Sahi is an automation and testing tool for web applications, with
the facility to record and playback scripts. Developed in java and
JavaScript, this tool uses simple JavaScript to execute events on
the browser. Can be used for Web wrapping or other semantic tasks

Samizdat is a generic RDF-based engine for building collaboration
and open publishing web sites. Samizdat provides users with means
to cooperate and coordinate on all kinds of activities, including
media activism, resource sharing, education and research

Schema-Cache is an online search engine for RDF vocabularies and
schemas that you can look up by schema name, properties and
classes. Results can be viewed as RDF/XML, JSON, or JSONP; SPARQL
and JSON Query interfaces are available

Scone is a high-performance, open-source knowledge-base (KB) system
intended for use as a component in many different software
applications. Scone supports simple inference over the elements and
statements in the knowledge base: inheritance of properties from
more general descriptions, following chains of transitive
relations, detection of type mismatches, and so on

The scones (Subject Concept Or Named EntitieS) tagger provides
information extraction of domain-specific subject concepts and
entities from unstructured text. It also provides disambiguation of
this information based on the context of the source information.
Its baseline configuration works off of the UMBEL ontology for
concepts and Wikipedia for entities, though supplements are
recommended for specific domain needs.

SDB is a persistent storage subsystem for Jena that is designed
around SPARQL. SDB is not a replacment for Jena's current database
layer which is more reliable and widely tested. SDB is an ARQ query
engine that also happens to support the Jena Graph API via
translation to SPARQL

SEAL (Set Expander for Any Language) uses set expansion to expand a
given partial set of objects into a more complete set. A well-known
example system that does set expansion using the web is Google
Sets. SEAL uses a novel method for expanding sets of named
entities. The approach can be applied to semi-structured documents
written in any markup language and in any human language.

Searchy is a metasearch engine that is able to integrate
information from a wide range of sources performings a semantic
translation into RDF. It has a distributed nature and is specially
suitable to integrate information across different organisations
with a minimun coupling

SecondString, an open-source Java-based package of approximate
string-matching techniques. SecondString is intended primarily for
researchers in information integration and other scientists. It
does or will include a range of string-matching methods from a
variety of communities, including statistics, artificial
intelligence, information retrieval, and databases. It also
includes tools for systematically evaluating performance on test
data. It is not designed for use on very large data sets

SegmentationApp is a Java application that segments a given
ontology according to the approach described in "Web Ontology
Segmentation: Analysis, Classification and Use"
(http://www.co-ode.org/resources/papers/seidenberg-www2006.pdf).

Semantic Bank is the server companion of Piggy Bank that lets you
persist, share and publish data collected by individuals, groups or
communities. In the past it was a standalone project but now comes
bundled with Longwell as a Longwell configuration.

The Semantic Data Extractor from W3C is an online tool, geared by
an XSLT stylesheet, that tries to extract some information from a
HTML semantic rich document. It only uses information available
through a good usage of the semantics defined in HTML. It is a bit
finicky and requires valid (X)HTML as the starting point.

The Semantic Discovery System (SDS) uses an ontology-driven
approach to query over federated data sources. It provides bridges
to legacy data sources such as Oracle, Excel and other proprietary
formats to make them accessible via SPARQL queries and related
semantic Web technologies.

Semantic Fire is a tool for generating RDF from websites. It can be
used to make it possible to query the data. It is a Web scraper
plug-in to Firefox that can make the data available via an API or
in downloadable format. It creates a simple yaml file that maps a
Web site's content to semantic Web ontologies.

Semantic LDAP consists of two contributing applications. The first
is LDAP 2 SPARQL (http://dl-learner.org/Projects/LDAP/Backend), is
a backend to the widely used OpenLDAP server. It translates LDAP
queries into SPARQL queries, asks a SPARQL endpoint and translates
the result back to LDIF. The second is LDAP 2 OWL
(http://purl.org/net/ldap/ldap2owl.php), which converts a directory
information tree (DIT) complete with schema information into an OWL
ontology with directory objects represented in RDF

Semantic MediaWiki is project incorporating a series of modules to
the MediaWiki software (powering Wikipedia), which extends the Wiki
with ideas from the Semantic Web. It focuses on usability and tight
integration.

Semantic Server, built by Semantic Soft, stores information in
multiple semantic repositories with strong meta-data management
features, offers features for collaborative development with
security and user management, publishes content from repositories
on the web, and has a visual SPARQL processor

Semantic Tools for Web Services is a set of Eclipse plug-ins that
allow developers to insert semantic annotations into a WSDL
document to describe the semantics of the input, output,
preconditions, and effects of service operations. A second plug-in
matches the description of the service or composition of services
to that for which a developer is searching. This technology is part
of the Emerging Technologies Toolkit (ETTK)

Semantic Turkey is a Firefox Extension which can be used to keep
track of relevant information from visited web sites and organize
collected content according to a personally defined ontology. Clear
separation between knowledge data (the WHAT) and web links (the
WHERE) is established into the knowledge model of the system, which
allows for innovative navigation of both the acquired information
and of the pages where it has been collected.

Semantic Web Client library represents the complete Semantic Web as
a single RDF graph, which can be queried using SPARQL. To answer
queries, the library dynamically retrieves information from the
Semantic Web by dereferencing HTTP URIs and by following rdfs. The
library is written in Java and is based on the Jena framework

Semantic Web Pipes is an online Ajax service that enables fetch,
mix and process RDF files published on the Web. The output of a
Pipe is an HTTP retrievable RDF model; thus, simple pipes can also
work as inputs to more complex Pipes. SW Pipes currently has 4 RDF
Pipes components (Fetch, Mix, SPARQL Transform and RDFS
materialization), with more planned.

Semantica® represents knowledge in the form of semantic networks.
Semantica incorporates a Geospatial Information System (GIS) and
supports native .KML output for integration into other GIS tools,
Wikimedia and related open-source tools

SemanticSTEP Viewer (SSV) is the user interface (frontend) to
access SRDB and work with ontologies. SSV is a web browser
application collected into a Java servlet on some web server. This
frontend uses HTML, XML Stylesheet Transformations (XSLT) and
JavaScript to provide the user interface.

Semaphore is an enterprise content intelligence platform. It is
built from four core and inter-connected modules: 1) an ontology
manager; 2) classification and text mining; 3) semantic server; and
4) a search appliance

Semantic Clipboard (SemClip) enables the exchange of semantically
enriched data between desktop applications and from Web pages. It
uses the clipboard to handle such formats as FOAF, RDFa, vCard,
hCard, etc. Currently only runs on the Mac OS

SemCocoon is an extended stand-alone distribution of Cocoon 2.1.3,
with added components that enable Web Services- and Semantic
Web-functionality (an embedded Ontogator and an interface to
Ontodella), along with other minor enhancements

SemFS is a semantic file system based on RDF. On Windows, it can be
mounted as a WebDAV drive. For Linux, SemFS can be used as a user
level file system via FUSE. SemFs currently supports tagging of
files and browsing according to various ontologies

SemLens is a visual tool that allows to arrange objects in a
scatter plot and to analyze them by user-defined semantic lenses.
The lenses can be independently defined for each of the objects'
properties and can be combined by logical operators. The scatter
plot provides a global overview and supports the discovery of
dependencies and correlations also in large datasets

SemMF is a flexible framework for calculating semantic similarity
between objects that are represented as arbitrary RDF graphs. The
framework allows taxonomic and non-taxonomic concept matching
techniques to be applied to selected object properties

SemSearch is a semantic search engine, which is designed for naïve
users, i.e., ordinary end users who are not necessarily familiar
with domain specific semantic data, ontologies, or SQL-like query
Languages

semweb4j offers a unique framework to create semantic web
applications with Java fast and type-safe. It consists of three
components: RDF2GO is a wrapper around most available triple
stores; RDFReactor allows you to program with an object-oriented
Java API; and jREST puts your Java objects on the (semantic) web

The Semantic Web Integrator and Query Engine (SemWIQ) can be used
to integrate information from heterogeneous, distributed
information systems such as relational databases, spreadsheets,
etc. with Semantic Web technology.

SERF is a generic infrastructure for Entity Resolution (ER) (also
known as deduplication, or record linkage). ER is an important
information integration problem for resolving "real-world entities"
(e.g., customers, or products) that are referred to in different
ways in multiple data records.

Sesame is an open source RDF database with support for RDF Schema
inferencing and querying. It offers a large scale of tools to
developers to leverage the power of RDF and RDF Schema. It was
recently updated to Sesame 2.0

SETH is a software effort to deeply integrate Python with Web
Ontology Language (OWL-DL dialect). The idea is to import
ontologies directly into the programming context so that its
classes are usable alongside standard Python classes.

Sewese is a platform built on the CORESE engine. This framework
provides a set of functionalities like generation of interfaces for
requests, edition and navigation, and management of the transverse
functions of a portal (presentation, internationalization,
security, ...). An ontology editor, a generic annotation editor and
a basic rule editor are parts of the Sewese platform

Sgvizler is a javascript which renders the result of SPARQL SELECT
queries into charts or HTML elements. Almost all major chart types
offered by the Google Visualization API are supported plus a few
more are added: Line Chart, Area Chart, Column Chart, Bar Chart,
Scatter Chart, Sparkline, Pie Chart, Candlestick Chart, Gauge, Org
Chart, Tree Map, Motion Chart, Timeline, Geo Chart, Geo Map, Map,
Map+, Table, List, Definition List, Text. The SPARQL query may be
given in a textarea, similar to that of SPARQLer and Snorql, or put
in a div html element, similar to Spark

SHAME is a metadata editing and presentation framework for RDF
metadata. Annotation profiles are then used to generate User
Interfaces (semantic) for either editing, presentation or querying
purposes. The user interface may be realized in a web setting (both
a jsp and velocity version exists) or in a stand alone application
(a java/swing version exists)

Sidewinder is a new kind of application framework that uses
standard web languages such as XHTML, SVG, XForms and Ajax, in
place of more complex languages such as C++, C# or Java, to build
complete, internet-facing, desktop applications.

SIE (Simple Information Extraction) is an information extraction
system based on a supervised machine learning technique for
extracting implicit relations from documents. In particular,
Information Extraction (IE) is cast as a classification problem by
applying Support Vector Machines (SVMs) to build a set of
classifiers for detecting the boundaries of the entities to be
extracted.

Sifter is an experimental Firefox extension that is the first
end-to-end solution that integrates a web data extraction algorithm
with a user interface for automatically filtering and sorting
structured data from Web sites.

The Silk famework is a tool for discovering relationships between
data items within different Linked Data sources. Data publishers
can use Silk to set RDF links from their data sources to other data
sources on the Web.

SimMetrics is an open source extensible library of Similarity or
Distance Metrics, e.g. Levenshtein Distance, L2 Distance, Cosine
Similarity, Jaccard Similarity etc etc. SimMetrics provides a
library of float based similarity measures between String Data as
well as the typical unnormalised metric output. It is intended for
researchers in information integration and related fields. It
includes a range of similarity measures from a variety of
communities, including statistics, DNA analysis, artificial
intelligence, information retrieval, and databases.

Sindice is a simple lookup index for Semantic Web resources that
use RDF. Sindice indexes the Semantic Web and can tell you which
sources mention a certain resource URI. Besides the online service,
it is possible to download the open source code at
https://launchpad.net/sindice

The SIOC Exporter for WordPress is a plugin that supports creation
of SIOC (Semantically-Interlinked Online Communities), one of the
standard RDF ontologies. It works with all WordPress versions above
1.5

sitemap4rdf is a command-line tool that generates sitemap.xml
files, that follow the sitemap protocol, for Linked Data sites that
have a SPARQL endpoint. The Sitemap protocol is supported by the
major search engines to ensure quick and complete indexing of
websites. Use of the Sitemap protocol is also recommended by
Sindice.

SMAD (Semantic Mobile Application Development) is a catalyst for
mobile Web applications that incorporate semantic data into
location based services. The emphasis is on data about the location
and points of sales and services (shops, etc.)

SmartLink (SeMantic Annotation enviRonmenT for Linked Services) is
an online Web application to aid users in the creation and browsing
of Linked Services, which are semantic API/service annotations
following Linked Data principles. SmartLink provides an interface
to populate and navigate through Linked Services repositories) and
a set of APIs to further exploit the descriptions provided with
SmartLink

S-Match is a semantic matching framework that takes any two tree
like structures (such as database schemas, classifications,
lightweight ontologies) and returns a set of correspondences
between those tree nodes which semantically correspond to one
another. It has a choice of four or so matching algorithms and a
graphical interface for seeing the putative matches.

SMOB is a Semantic Microblogging client and server application.
SMOB is entirely based on Semantic Web and Linked Data
technologies. Each hub locally stores its data as native RDF (using
ARC2, also providing a SPARQL endpoint per hub) and the
communication between hubs is provided via SPARQL/Update over HTTP.
Each hub provides RDFa information about itself and the microblog
posts it contains, using SIOC, FOAF and OPO as well as interlinking
with MOAT and CommonTag.

Snoggle is a graphical, SWRL-based ontology mapper. Snoggle
attempts to solve the ontology mapping problem by providing a
graphical user interface (similar to which of the Microsoft Visio)
to guide the process of ontology vocabulary alignment. In Snoggle,
user-defined mappings can be serialized into rules, which is
expressed using SWRL.

Snoggle is a graphical, ontology mapper based on the Semantic Web
Rule Language (SWRL) and assists in aligning ontologies represented
in the Web Ontology Language. It allows users to visualize
ontologies and then draw mappings between them on an intuitive
graphical canvas. Users draw mappings as appropriate, and the tool
then transforms them into SWRL/RDF or SWRL/XML for use in a
knowledge base.

Snow Owl is a state-of-the-art authoring platform for clinical
terminologies (though can be used with any ontology). It supports
the extended SNOMED CT compositional grammar for semantic querying,
and allows the creation of intensional and extensional value sets
among other reference set types that follow the IHTSDO SNOMED CT
RF2 specifications. In addition to authoring SNOMED CT or a
national extension, Snow Owl is designed to support a variety of
ontologies and classifications. Snow Owl is designed to be a
framework to support higher-level terminology authoring tools, such
as drug and device dictionaries. Snow Owl is built upon the Eclipse
platform. Snow Owl can operate in two modes; in stand-alone mode
supporting a single user or in collaborative mode

SOFA is a Java API for modeling ontologies and Knowledge Bases in
ontology and Semantic Web applications. It provides a simple,
abstract and language neutral ontology object model, inferencing
mechanism and representation of the model with OWL, DAML+OIL and
RDFS languages; from java.dev

SONIVIS is an Eclipse-based open source software for analysing and
visualizing virtual information spaces such as Wiki, Weblogs, or
Social Network Sites. The objective is to investigate different
development stages of these information spaces and therefore to
recommend appropriate knowledge management activities depending on
each stage.

Soprano (formerly known as QRDF) is a library which provides a
highly usable object-oriented C++/QT4 framework for RDF data. It
uses different RDF storage solutions as backends through a simple
plugin system. Soprano is targetted at desktop applications that
need a RDF data storage solution. It has been optimized for easy
usage and simplicity.

Sparallax is a faceted browsing interface for SPARQL endpoints,
based on the Freebase Parallax applicatioin. Sparallax works with a
MQL to SPARQL two way translator (Request and Reply) which sits
between Parallax and the SPARQL endpoint. Sparallax allows
customizations on how to visualize a Semantic Web dataset based on
simple configuration files.

SPARCool offers a way to run SPARQL queries about any URI that
follows the Linked Data principles by calling a simple URL based on
the http://sparcool.net/format/predicate[;l=lang]/URI pattern. It
supports XML, JSON and HTML formats.

Spark is a fully HTML5 compliant JavaScript library that allows
results from SPARQL queries to be included in any website (as a
"spark"), or, by using QCrumb to use any RDF data. Spark is an
extensible mechanism to define renderers for SPARQL result sets

SPARQL is a query language for getting information from such RDF
graphs. It provides facilities to extract information in the form
of URIs, blank nodes and literals; extract RDF subgraphs; and
construct new RDF graphs based on information in the queried graphs

SPARQL queries are not confined by datasource boundries. Queries
over distributed data often entail querying one source and using
the acquired information to constrain queries of the next source.
SPARQLfed bundles an intermediate result set with a SPARQL query,
allowing the remote engine to locally join its data against the
current constraints.

SparqlMotion is a visual language from TopQuadrant that enables
average users to define scripts that import, post-process, query
and visualize data using semantic Web technology. Users can define
and share those scripts as OWL/RDF models, based on a dedicated
SparqlMotion ontology and module library. An ontology editor can be
used to define the data and execution flow of these scripts using
drag and drop. Scripts can be executed within Composer or on the
Web using the TopBraid Live platform.

The SparqlTrainer is an e-learning tool to practice SPARQL
interactively. With the help of self-developed tests, consisting of
individual questions, it is possible to train the different query
forms (SELECT, CONSTRUCT, ASK and DESCRIBE).

SPARQLTree turns SPARQL results into more manageable trees of data
using a simple convention on variable names, which are grouped
together into "condensed" object trees, suitable for common ("OO")
programming languages to handle efficiently (think JSON). The
current implementation can be used programmatically to convert
regular results on the fly into "idiomatic" JSON

SparqlXMPP is a messaging framework to transport SPARQL queries to
various endpoints, querying the RDF data stored at those endpoints
using SPARQL, and then receving data in the standardized SPARQL XML
query results format. It uses XMPP, the protocol behind the more
commonly known Jabber chat services, that is an XML based
communications format with significant support for extensibility.
SparqlXMPP uses the jabberpy jabber library, client/server software
written in Python, and uses Redland for RDF processing and storage.

Sparta is an Python API for RDF that is designed to help easily
learn and navigate the Semantic Web programmatically. Unlike other
RDF interfaces, which are generally triple-based, Sparta binds RDF
nodes to Python objects and RDF arcs to attributes of those Python
objects

SPASQL is experimental effort to add SPARQL support to MySQL. It is
based on query translation work between SPARQL and SQL, where there
is a high degree of conformance. A SPASQL query can be seen as a
SPARQL query with only a query string. SPASQL does not support
CONSTRUCT or DESCRIBE at this time

SquirrelRDF is a tool which allows non-RDF data stores (or,
perhaps, not explicitly RDF) to be queried using SPARQL. In its
current form this includes relational databases (via JDBC) and LDAP
servers (via JNDI). It provides an ARQ QueryEngine (for java
access), a command line tool, and a servlet for SPARQL http access.
As a result the information now looks like RDF, and is always
current.

Apache Stanbol is an Open Source HTTP service meant to help Content
Management System developers to semi-automatically enhance
unstructured content (text, image, ...) with semantic annotations
to be able to link documents with related entities and topics

The Stanbol Ontology Manager provides a controlled environment for
managing ontologies, ontology networks and user sessions for
semantic data modeled after them. It provides full access to
ontologies stored into the Stanbol persistence layer. Managing an
ontology network means that you can activate or deactivate parts of
a complex model from time to time, so that your data can be viewed
and classified under different "logical lenses". This is done via
the OntoNet component of the manager

Strata from Kirix is a specialty browser for accessing and
manipulating Web data obtained from tables, CSV files, RSS feeds,
MySQL, Oracle and other databases. Strata incorporates the Mozilla
Foundation's Gecko layout engine. Data can be stored and
manipulated locally with flexible report creation. Strata comes
with a data-enabled version of ECMAScript (e.g., JavaScript) with
built-in support for SQL.

StrixDB provides a RDF graph store, SPARQL and SPARQL/Update
compliant, with Datalog inference capabilities. It could be used as
a SPARQL server with Apache HTTP Server or standalone as a regular
Windows DLL or as a Lua module.

The Structured Data Linter digests structured data, in the form of
HTML marked-up with RDFa or Microdata, or other RDF technologies
supported in Linked Data. Output is expressed as HTML+RDFa in a
Snippet format. Formats supported include N-Triples, RDF/XML,
Turtle and Notation3, RDFa, RDF/JSON, TriX, Microdata and JSON-LD

structWSF is a platform-independent Web services framework for
accessing and exposing structured RDF data. Its central organizing
perspective is that of the dataset. The structWSF framework is
RESTful in design and is based on HTTP and Web protocols and open
standards. The initial structWSF framework comes packaged with a
baseline set of about a dozen Web services in CRUD, browse, search
and export and import. All Web services are exposed via APIs and
SPARQL endpoints.

SuRF is an Object Oriented - RDF Mapper based on the popular rdflib
python library. It exposes the RDF triple sets as sets of resources
and seamlessly integrates them into the Object Oriented paradigm of
python in a similar manner as ActiveRDF does for ruby

Surnia can check an OWL ontology/knowledge base for inconsistency
and entailments. It is implemented as a wrapper around first-order
theorem prover (OTTER, for now at least). Unlike Hoolet (which
turns the OWL into FOL), Surnia just turns the OWL into triples and
mixes in axioms

SVMlight is an implementation of Vapnik's Support Vector Machine
[Vapnik, 1995] for the problems of pattern recognition, regression,
and learning a ranking function. SVM is often a key component in
various NLP tools (among others)

SWAML, pronounced [swæml], reads a collection of email messages
stored in a mailbox (from a mailing list compatible with RFC 4155)
and generates a RDF description. It is written in Python using SIOC
as the main ontology to represent in RDF a mailing list

Swangling is a method for using the text search capabilities on the
web for semantic searches. The Swangler creates annotated documents
from RDF-based documents that can be indexed and then searched for
on the web based on their semantic content

A semantic Web processor using Lisp. Axioms and entailment rules in
RDF/S are embodied in the system so that a lisp programmer can
codify ontology in RDF/S and use the ontology within the semantics
specified by RDF/S documents

SWEET is a tool for users to create semantic RESTful services by
structuring service descriptions and associating semantic
annotations. SWEET aims to support a higher level of automation
when performing common tasks with RESTful services, such as their
discovery and composition. For demo, see
http://sweetdemo.kmi.open.ac.uk/soa4all/MicroWSMOeditor.html

SweetWiki (Semantic WEb Enabled Technology Wiki) is a semantic wiki
based on the CORESE engine. It only supports the wysiwyg edition of
pages and annotations, and uses the CORESE engine for all
operations : navigation, search, etc.

Swignition (formerly known as Cognition) is a parser for both
“upper case Semantic Web” (RDF, RDFa) and “lower case semantic web”
(microformats) technologies. It includes modules for exporting
parsed data in a variety of formats, including RDF, vCard,
iCalendar, Atom and KML

SWObjects is a library that provides general support for semantic
Web applications, including SPARQL queries, SPARQL updates, Rdf2RDB
(access to relational databases) and SWxform (rule-based query and
data transformation)

Swoogle is the definitive semantic Web search engine with 1.5 M
resources and 10,000 ontologies listed; it only indexes some
metadata about Semantic Web documents. It neither stores nor
searches all triples in an Semantic Web documents as a triple store

SWRL-IQ is a plugin for Protege 3.x that allows users to edit,
save, and submit queries to an underlying inference engine based on
XSB Prolog. The tool features backward- and forward-chaining
reasoning; constraint-based solving; debugging; and saving and
exports. It also supports SWRL extensions and attached Java
procedures

SWSE is a Semantic Web Search Engine (pronounced "swizzy" or
"swishy", the prjoject states both!). SWSE uses many existing
vocabularies and ontologies to apply powerful inferencing
techniques for extended search queries. An online prototype is
available at http://swse.deri.org/

Sztakipedia is an experimental editing assistant for Wikipedia (and
for other web-based systems). The project focuses on enabling
everyday users to create documents with the help of text mining,
intelligent search, linked data and machine learning.

Tab2OwlClass is a Java tool for importing classes into an already
existing OWL file. The script uses the OWL-API to read in a tab
delimited file of class details and create OWL classes from these
rows, adding them to an existing ontology

Tal4Rdf (T4R) is a template system for RDF that can produce
HTML/XHTML (any XML in fact). T4R is based on Zope’s Template
Attribute Language and is well suited to produce HTML or any
XML-based format or non-XML formats. Besides being available as a
software library, it can also be used through an HTTP-based service

The Talis Platform is an online environment for building next
generation applications and services based on Semantic Web
technologies. It is a hosted system which provides an efficient,
robust storage infrastructure. Both arbitrary documents and
RDF-based semantic content are supported, with sophisticated query,
indexing and search features. While not open source, most
components are built on open source and no cost access may be
provided to certain developers

Telescope maps resources from RDF graphs to Python objects. Its
design is heavily influenced by SQLAlchemy. It has three main
components: 1) a programmatic, generative SPARQL query builder; 2)
a configurable mapper for binding resources to existing Python
domain objects; and 3) a declarative syntax for creating mapped
classes.

The TELOS OWL-DL graphic editor is Web-based, available on any
platform through a Web browser and has export/import capabilities
to standard XML OWL-DL files. This editor provides complete graphic
edition to facilitate the inception phase and the communication of
ontologies.

Online and batch term extractor that emphasizes part of speech
(POS) and n-gram (phrase extraction). TerMine is the terminological
management system with the C-Value term extraction and AcroMine
acronym recognition integrated

Text2Onto is a framework for ontology learning from textual
resources that extends and re-engineers an earlier framework
developed by the same group (TextToOnto). Text2Onto offers three
main features: it represents the learned knowledge at a metalevel
by instantiating the modelling primitives of a Probabilistic
Ontology Model (POM), thus remaining independent from a specific
target language while allowing the translation of the instantiated
primitives into knowledge representation mechanisms such as RDFS,
OWL, and F-Logic

TextGarden is a suite of text-mining software tools to enable easy
handling of text documents for the purpose of data analysis
including automatic model generation and document classification,
document clustering, document visualization, dealing with Web
documents, crawling the Web and many others.

tFacet applies known interaction concepts to allow hierarchical
faceted exploration of RDF data. The aim is to facilitate ordinary
users to formulate semantically unambiguous queries so as to
support the fast and precise access to information. Used
interaction concepts are e.g. a directory tree and interchangeable
columns that are already well-known from other applications

Thea2 is a Prolog library that provides complete support for
querying and processing OWL 2 ontologies directly from within
Prolog programs. Thea2 also offers additional capabilities
including a bridge to the Java OWL API and translation of
ontologies to Description Logic programs

Thema is an XML based data format (DTD) for thesauri, glossaries,
lexicons, conceptual maps etc. up to ontologies. It contains
publishing tools to convert into HTML, RDF etc. and to read
different formats and is has a connection to the Semantic Web

ThManager is a tool for creating and visualizing SKOS RDF
vocabularies. ThManager facilitates the management of thesauri and
other types of controlled vocabularies, such as taxonomies or
classification schemes.

ThoughtTreasure is a comprehensive platform for natural language
processing and commonsense reasoning. It runs on PCs and Unix and
includes 20,000 concepts organized into a hierarchy, 50,000 English
and French words and phrases, a syntactic and semantic parser and
an English and French generator. Application programs can use
ThoughtTreasure to obtain answers to questions easily answered by
humans but previously difficult for computers

tOKo is an open source tool for text analysis and browsing a corpus
of documents. It implements a wide variety of text analysis and
browsing functions in an interactive user interface. An important
application area of tOKo is ontology development. It supports both
ontology construction from a corpus, as well as relating the
ontology back to a corpus (for example by highlighting concepts
from the ontology in a document).

Top Engine is a business rule engine that uses the Web Ontology
Language (OWL DL) as a modeling language for specifying the Domain
Model of the Business Application. Top Engine allows you to write
rules on top of ontologies using a fusion between Description Logic
and Logic Programs formalisms.

Top Quandrant's TopBraid Composer is a complete standards-based
platform for developing, testing and maintaining Semantic Web
applications based on Eclipse There is now a limited functionality
free edition.

TopBraid Ensemble (TBE) uses pre-configured and customizable
application templates to navigate through semantic models and
corresponding data, using trees, grids and search form components,
including editable forms. They can also include maps and graph
visualization and query builder components

Topbraid Enterprise Vocabulary Net (EVN) is a Web-based solution
for simplified development and management of interconnected
controlled vocabularies. It supports users who need to collaborate
on defining and linking enterprise vocabularies, taxonomies,
thesauri and ontologies used for information integration,
customization and search. It includes vocabulary editing, search,
structure definition and import, and is based on SKOS. It is part
of the the commercial TopBraid Composer.

Topicalizer is a service which automatically analyses a document
specified by a URL or a plain text regarding its word, phrase and
text structure. It provides a variety of useful information on a
given text including the following: Word, sentence and paragraph
count, collocations, syllable structure, lexical density, keywords,
readability and a short abstract on what the given text is about.

Topincs is a topic map authoring software that allows groups to
share their knowledge over the web. It makes use of a variety of
modern technologies. The most important are Topic Maps, REST and
Ajax. It consists of three components: the Wiki, the Editor, and
the Server. The servier requires AMP; the Editor and Wiki are based
on browser plug-ins.

SPIN is a collection of RDF vocabularies enabling the use of SPARQL
to define constraints and inference rules on Semantic Web models.
SPIN also provides meta-modeling capabilities that allow users to
define their own SPARQL functions and query templates. Finally,
SPIN includes a ready to use library of common functions.

Total Validator is a free one-stop all-in-one validator comprising
a HTML validator, an accessibility validator, a spelling validator,
a broken links validator, and the ability to take screenshots with
different browsers to see what your web pages really look like. It
also includes an XHTML+RDFa validator

TouchGraph is a set of interfaces for Graph Visualization using
spring-layout and focus+context techniques. Current applications
include a utility for organizing links, a visual Wiki Browser, and
a Google Graph Browser which uses the Google API

The TPTP (Thousands of Problems for Theorem Provers) is a library
of test problems for automated theorem proving (ATP) systems. The
TPTP supplies the ATP community with comprehensive libraries,
lists, utilities, guidelines and standards. These are useful for
OWL reasoners

tRDF4Jena extends the Jena framework with classes to represent,
determine, and manage trust values that represent the
trustworthiness of RDF statements and RDF graphs. Furthermore,
tRDF4Jena contains a query engine for tSPARQL, a trust-aware
extension to the query language SPARQL.

tRDF4JenaBench contains tools to benchmark the tSPARQL engine and
the trust functions in tRDF4Jena. The benchmark is based on the
Berlin SPARQL Benchmark (BSBM). The tRDF4Jena benchmark uses
benchmark data generated for BSBM as well as supplementary,
self-generated data. This supplementary data comprises assertions
of trust assessments about the BSBM data.

Treebolic is a Java component (widget) whose purpose is to provide
a hyperbolic rendering of hierarchical data. A tree is rendered
with nodes and edges but display space is subject to a particular
curvature (hence the name) : more space is allocated to the focus
node while the parent and children, still in the immediate visual
context, appear slightly smaller. The grandparents and
grandchildren are still visible but come out even smaller. As one
moves away from the focus node, less display space is allotted to
the nodes, which gradually disappear towards the disk's border.
Nodes may then contain hypertext links and the browser to other web
pages. The tree is dynamic (animation brings the focus node to the
center) and responds to user interaction.

Trellis is an interactive environment that allows users to add
their observations, viewpoints, and conclusions as they analyze
information by making semantic annotations to documents and other
on-line resources

The Trainable Relation Extraction framework has been developed as a
testbed for experimenting with several extraction algorithms and
scenarios. The framework aims to be general enough to support a
variety of entity extraction and relation extraction algorithms
from several input formats.

The Transparent RDF Inference Engine is an innovative graphical
debugger for rules - it works but is limited to RDF files without
anonymous nodes and rules without functors or built-ins. The
project is created in Java, is build on top of Eclipse and utilizes
the Jena framework

TripleSoup will be an RDF store, tooling to work with that
database, and a REST web interface to talk to that database using
SPARQL, implemented as an Apache webserver module; in development
as a proposal to the Apache project

Triplipse is a visualization tool for the Resource Description
Framework (RDF) within Eclipse. It provides two views of the RDF
Graph: one as a simple, sortable table of the Statements, and one
as a tree instead of the typical bubble-graph. Triplipse accepts
RDF+XML and Turtle formats, and is designed to be easily extended
to provide more formats in the future. The primary goal of
Triplipse is to provide developers with an easy-to-use
visualization tool for a large RDF Document without having to
scroll through pages and pages of RDF text and without having to
scroll through screens and screens of hard to read and hard to
connect images

Tripliser is a Java library and command-line tool for creating
triple graphs, and RDF serialisations, from XML source data. It is
particularly suitable for data that is messy, bulky or volative. It
is designed as an alternative to XSLT conversion

Triplr is a service that generates RDF triples from URIs. This
service helps convert formats as need be and lets you access
embedded RDF via GRDDL (such as microformats when they have a
transformation) without the need to pre-process it yourself.

TrueKnowledge combines natural language analysis, an internal
knowledge base and external databases to offer immediate answers to
various questions. Instead of just pointing to web pages, it offers
explicit answers and explains the reasoning behind them. Users may
add to the knowledge base through a simple sequence of steps,
contradict or correct existing knowledge. There is also a query
language and API for access by external services.

tSPARQL is a trust-aware query language that has been developed
within the tRDF project. tSPARQL extends the RDF query language
SPARQL to query the trustworthiness of RDF data; it enables users
to describe trust requirements in a declarative manner and to
access the trust values associated to query solutions.

Tulip is designed for the information visualization of graphs
having more than 1,000,000 elements. Tulip draws and displays huge
graphs, allows the navigation through geometric operations as well
as the extraction of subgraphs and the enhancement of the results
obtained by filtering. There are many graph options and plugins.

Twarql encodes information from microblog posts as Linked Open Data
in order to enable flexibility for those interested in collectively
analyzing microblog data for sensemaking. The system extracts
content from Twitter and encodes that content in RDF using
vocabularies such as FOAF, SIOC, MOAT, etc.

txt2re acts as a regular expression generator. Instead of trying to
build the regular expression, you start off with the string that
you want to search and paste it into the site. Upon submit the site
finds recognisable patterns in the string and can generate
expressions usable in multiple languages

Uberblic imports structured data from sources like Wikipedia,
Geonames, DBpedia, and the BBC, and integrates those sources into a
single, coherent data set. It provides a single point of access to
integrated web data.

Ubiquity is an experiment into connecting the Web with language in
an attempt to find new User Interfaces (semantic) that make it
possible for everyone to do common Web tasks more quickly and
easily. It's a Firefox extension. Many of the contributed scripts
(http://ubiquity.mozilla.com/herd/) are related to semantic Web or
information extraction

The Ubiquity RDFa project provides a JavaScript RDFa parser that
can be used either on its own, or with other parts of the Ubiquity
library. The library is embedded into an HTML or XHTML document,
and any RDFa is parsed and stored in a simple triple store. The
store can be queried with a subset of SPARQL, expressed in JSON
syntax, called jSPARQL.

This application can be used for debugging Linked Data sites as an
alternative to the command line tool curl. This application allows
you to dereference URIs and it visualizes the HTTP response of the
server. In contrast to curl, you may directly select each URI that
occurs in the response in order to initiate the dereferencing of
the selected URI with this application. Hence, using this tool you
may avoid the cumbersome copying and pasting of URIs on the command
line as is necessary with curl.

URIBurner is a simple but power service that delivers RDF based
structured descriptions of Web addressable resources (documents or
real world objects) in a variety of formats, including but not
limited to RDF/XML, Turtle, N3. It supports hand-written or
query-by-example SPARQL querying over linked data

Vapour (VAlidation Process Of Uri Response) is a web-based
validator tool to check the compliance of a web site against the
recipes described in the Best Practices Recipes for Publishing RDF
Vocabularies (W3C Working Draft).

ve2 is a Web application that enables you to generate a voiD file
(voiD is a vocabulary to describe Linked Data sets, their
interlinking with other datasets, technical features, etc.) using a
number of forms to define the categories, interlinking, technical
features, licensing, etc. of your vocabulary in RDF Turtle format.

Versa is a query language designed for the specific purpose of
extracting information from an RDF graph in a very modular way. A
Versa query facilitates the isolation of resources, and their
associated property values through specific patterns and
constraints as specified by a Versa expression (somewhat
reminiscent of XPath expressions and regular expressions)

VINE is a tool that allows users to perform fast mappings of terms
across ontologies. It performs smart searches, can search using
regular expressions, requires a minimum number of clicks to perform
mappings, can be plugged into arbitrary mapping framework, is
non-intrusive with mappings stored in an external file, has export
to text files, and adds metadata to any mapping. See also
http://sourceforge.net/projects/vine/

OpenLink's Virtuoso is an innovative universal server platform that
delivers an enterprise-level data integration and management
solution for SQL, RDF, XML, Web services, and business processes.
For RDF, it supports SPARQL, helpful extensions such as update,
inference tables and RDFizers.

VisiNav is a system to search and navigate web data. The conceptual
model consists of objects (for example, a person). Objects can have
attributes (a person has a name). Objects can also have connections
to other objects (a person knows other people).

VisKo is a framework supporting the execution of declarative
visualization queries. The premise of this work is that
visualization users and developers can more easily specify for the
generation of visualizations by writing queries rather than
programming custom visualization pipelines . VisKo is composed of
the four components: ontologies; its query language; its knowledge
base; and reasoning

Visual Browser is a Java application that can visualise the data in
RDF scheme. The main principle of the visualisation is that the
triple (resource, resource, resource) is represented by two nodes
connected by an edge and is represented by a hint (small window
appearing on mouse over the subject node). Visual Browser uses the
Jena framework to obtain the data. The visualisation engine is
derived from TouchGraph LLC.

Vitro is a general-purpose web-based ontology and instance editor
with customizable public browsing. Vitro is a Java web application
that runs in a Tomcat servlet container. With Vitro, you can: 1)
Create or load ontologies in OWL format; 2) Edit instances and
relationships; 3) Build a public web site to display your data; and
4) Search your data with Lucene.

VIVID (Visual Variable-Depth Info Display) is a browser-based
viewing and publishing of complex OWL and RDF network structures
cast into a densely-packed presentation format. Viewers can
dynamically change the presentation by use of pivoting and an
interactive filtering mechanism.

ViziQuer is an application that connects to a SPARQL endpoint and
enables simple graphical queries to be constructed for retrieving
data. ViziQuer also allows exploration of SPARQL endpoint data
schema. It is designed to easily define restricted data set for
further analysis or just data browsing

voc2rdf is a Web-based tool for converting flat-file controlled
vocabulary documents containing tabular data into RDF/XML documents
that use classes and terms. The tool allows users to provide a
delimited file, a basic set of metadata, and produce a RDF/XML
document that can be imported into the MMI Ontology Registry and
Repository (ORR) and/or downloaded for use on a local computer.

VocBench is a Web-based, multilingual, vocabulary editing and
workflow tool. It transforms thesauri, authority lists and
glossaries into SKOS/RDF concept schemes for use in a linked data
environment. Originally created to manage AGROVOC, a multi-lingual
thesaurus with 40000 concepts in over 20 languages, it now hosts an
ever-expanding set of vocabularies. The environment can also be
downloaded separately

Semantic blogging software; w.bloggar is an application that acts
as an interface between the user and one or more blog(s); in other
words, it is a Post and Template editor, with several features and
resources that the browser based blog editors do not offer

Wandora is a Topic Maps engine written in Java with support for
both in-memory topic maps and persisting topic maps in MySQL and
SQL Server. It also contains an editor and a publishing system, and
has support for automatic classification. It can read OBO, RDF(S),
and many other formats, and can export topic maps to various graph
formats. There is also a web-based topic maps browser, and
graphical visualization.

Watson is a gateway for the Semantic Web. It plays three main
roles: 1- it collects the available semantic content on the Web, 2-
analyzes it to extract useful metadata and indexes, and 3-
implements efficient query facilities to access the data. (Still in
pre-release)

WebCAT is an extensible tool to extract meta-data and generate RDF
descriptions from existing Web documents. Implemented in Java, it
provides a set of APIs (Application Programming Interfaces) that
allow one to analyse text documents from the Web without having to
write complicated parsers

webExtractor is a Java application that is used for extracting
specific content from web based HTML, XML, CSV, and free form text.
The extracted data can be used for data gathering and mining
purposes; not updated since 2003-2004

Web-Harvest is Open Source Web Data Extraction tool written in
Java. It offers a way to collect desired Web pages and extract
useful data from them. In order to do that, it leverages well
established techniques and technologies for text/xml manipulation
such as XSLT, XQuery and regular expressions

Weka is a collection of machine learning algorithms for data mining
tasks. The algorithms can either be applied directly to a dataset
or called from your own Java code. Weka contains tools for data
pre-processing, classification, regression, clustering, association
rules, and visualization

WGFA (Web Gateway for Fact Assessment) is a web application to
create and manage W3C-OWL based ontologies, index websites, extract
XML-RDF or Dublin-Core metadata and provide search and query
operations on the websites based on the created semantic webs

WHIRL (Word-based Information Representation Language) is a
representation system that combines some of the properties of
relational databases, and some of the properties of statistical
ranked-retrieval systems.

Wikify! is a system to automatically "wikify" a text by adding
Wikipedia-like tags throughout the document. The system extracts
keywords and then disambiguates and matches them to their
corresponding Wikipedia definition.

The IBM Web Ontology Manager (WOM) is a lightweight, J2EE Web-based
system for managing Web Ontology Language (OWL) ontologies. It
enables developers to browse or search the ontologies registered
with the system by class or property names. In addition, they can
submit a new ontology file

Wraf (Web resource application framework) implements a RDF API that
hopes to realize the Semantic Web. The framework uses RDF for data,
user interface, modules and object methods. It uses interfaces to
other sources in order to integrate all data in one enviroment,
regardless of storage (LARGELY INACTIVE)

Wrangler is an interactive tool for data cleaning and
transformation. It allows transformation of messy, real-world data
into structured data tables. It has many export options, including
Excel, R, Tableau and Protovis

The Web Service Modeling Toolkit (WSMT) is a collection of tools
for use with the Web Service Modeling Ontology (WSMO), the Web
Service Modeling Language (WSML) and the Web Service Execution
Environment (WSMX)

The WSO2 Mashup Server is an open source mashup platform that hosts
JavaScript based mashups. It is based on Apache Axis2 and other
open source projects, and provides JavaScript authors the ability
to consume, compose and emit web services, feeds, scraped web
pages, email, and instant messages

The Mechanize Ruby library is used for automating interaction with
websites. Mechanize automatically stores and sends cookies, follows
redirects, can follow links, and submit forms. Form fields can be
populated and submitted. Mechanize also keeps track of the sites
that you have visited as a history

XLWrap is a spreadsheet-to-RDF wrapper which is capable of
transforming spreadsheets to arbitrary RDF graphs based on a
mapping specification. It supports Microsoft Excel and OpenDocument
spreadsheets such as comma- (and tab-) separated value (CSV) files
and it can load local files or download remote files via HTTP. The
bundled XLWrap-Server provides a public SPARQL endpoint

XML2OWL is an ANTLR-based program written in C++ for automatic
conversion of an Extensible Markup Language (XML) file to a Web
Ontology Language (OWL) file. The mapping rules are defined in a
text file with a .rules extension. Note that the OWL file created
during the conversion process contains only instance data (i.e.,
OWL individuals and their properties); the converter presumes the
existence of an OWL ontology that specifies the classes and
properties instantiated in the OWL file

Up to now, most ontologies are created manually, which is very
time-expensive. The goal is it, to produce ontologies automatically
via XSLT, which fit as good as possible to a given XML-file resp.
XML-Schema-file

XML2RDF maps XML instance data to RDF. It is part of the ReDeFer
project that employs a “XML Semantic Reuse Methodology” that
combines an XML Schema to OWL web ontology mapping, called XSD2OWL,
with XML2RDF. The ontologies generated by XSD2OWL are used during
the XML to RDF step in order to generate RDF data enriched by the
XML Schema semantics made explicit.

xOperator combines advantages of social network websites with
instant messaging. It is a semantic agent for xmpp / jabber network
which finds and shares content about resources (using RDF/SPARQL).
xOperator communicates via XMPP (the Jabber protocol) with the end
user and with other xOperators. By logging into the account of the
end user, xOperator is able to add itself to his roster (buddy
list) and check if the contacts of the users also run xOperator or
a compatible client.

XSD2OWL maps XML Schemas to OWL web ontologies. It is part of the
ReDeFer project that employs a “XML Semantic Reuse Methodology”
that combines XSD2OWL with a mapping from XML instance data to RDF,
XML2RDF. The ontologies generated by XSD2OWL are used during the
XML to RDF step in order to generate RDF data enriched by the XML
Schema semantics made explicit.

YARS (Yet Another RDF Store) is a data store for RDF in Java and
allows for querying RDF based on a declarative query language,
which offers a somewhat higher abstraction layer than the APIs of
RDF toolkits such as Jena or Redland

Zemanta makes suggestions to authors about links and pictures to
embed in blog posts based on semantic analysis. It works in the
Firefox browser and supports WordPress, Blogger and Typepad
blogging platforms. Currently in alpha.

Zeus provides a graphical environment to build distributed agent
systems. A rule engine, planner and visualisation tools are
included. The released version contains some extensions for the
DAML semantic web project and Web Services integration features

ZGRViewer is a graph visualizer implemented in Java and based upon
the Zoomable Visual Transformation Machine. It is specifically
aimed at displaying graphs expressed using the DOT language from
AT&T GraphViz and processed by programs dot, neato or others
such as twopi. ZGRViewer is designed to handle large graphs, and
offers a zoomable user interface (ZUI), which enables smooth
zooming and easy navigation in the visualized structure.

The Zitgist RDF browser is an online service for querying multiple
RDF datasets with a variety of views and faceted type restriction
selections. The front-end on the server is in PHP with the backend
powered by OpenLink's Virtuoso server

zLinks is a WordPress plug-in that shows blog readers existing
related data and interlinked URIs for a given URL and, in some
case, to enable them to perform actions based on this data. Icons
denote the type of action, which may also occur via the Zitgist RDF
Browser.

ZOOMA is an application for discovering optimal ontology mappings,
designed to automatically map text values to ontology terms. ZOOMA
performs text to ontology term mappings by operating over
"repositories of mappings".

ZoomInfo provides structured people or company search using a
unique summarization search engine that finds, understands and
saves useful information. It is an online service and a purchasee
enterprise service

SWRL-IQ is a plugin for Protege 3.x that allows users to edit,
save, and submit queries to an underlying inference engine based on
XSB Prolog. The tool features backward- and forward-chaining
reasoning; constraint-based solving; debugging; and saving and
exports. It also supports SWRL extensions and attached Java
procedures

VisKo is a framework supporting the execution of declarative
visualization queries. The premise of this work is that
visualization users and developers can more easily specify for the
generation of visualizations by writing queries rather than
programming custom visualization pipelines . VisKo is composed of
the four components: ontologies; its query language; its knowledge
base; and reasoning

Rex is a component of the Revelytix Emergent Analytics Software
Suite. It will execute any rules written using a Rules Interchange
Format (RIF) dialect (currently we support the BLD dialect, plus
the NOT operator)

The Stanbol Ontology Manager provides a controlled environment for
managing ontologies, ontology networks and user sessions for
semantic data modeled after them. It provides full access to
ontologies stored into the Stanbol persistence layer. Managing an
ontology network means that you can activate or deactivate parts of
a complex model from time to time, so that your data can be viewed
and classified under different "logical lenses". This is done via
the OntoNet component of the manager

Wrangler is an interactive tool for data cleaning and
transformation. It allows transformation of messy, real-world data
into structured data tables. It has many export options, including
Excel, R, Tableau and Protovis

Doozer is an on-demand domain hierarchy creation application that
is available as a Web service. It extracts a domain model from
Wikipedia using its category hierarchy, link graph and concept
descriptors. Inputs are a seed concept and optional broader domain
concept