ABSTRACT: PGP is developing into one of the main public key infrastructures (PKI) in the Internet. This paper argues that Directory support of PGP infrastructure can help overcome some of the drawbacks of this PKI. It also states some general requirements for a storage model for PGP keys.

1. short update on the TERENA Directory Schema Registry project, 2. collecting interests in schema listing/registry, 3. discussion about what work should be done with the old and now resubmitted drafts, 4. discussion about if there is a need for schema-ng which may include subjects like XML-S and RDF-S. 5. aob

This article presents an approach to citation segmentation that addresses special challenges as typically found in Digital Humanities applications. We perform citation segmentation from Optical Character Recognition (OCR) input obtained from volumes of a printed bibliography, the Turkology Annual. This showcase application features serious difficulties for state-of-the-art techniques in citation segmentation: multilingual citation entries, lack of data redundancy, inconsistencies, and noise from OCR input. Our approach is based on Markov logic networks (MLN) (Richardson and Domingos, Markov logic networks. Machine Learning, 62(1): 107–36, 2006), a framework of statistical relational learning that combines first-order logic with probabilistic modeling. Formalization in first-order logic offers high expressivity and flexibility, and makes it possible to tailor segmentation to specific conventions of a given bibliography. We show that in face of the specific difficulties found with seg...

Proceedings of the Second IEEE International Conference on e-Science and Grid Computing E-SCIENCE ´06. IEEE Computer Society 2006. Amsterdam 2006

2007-01

article

TextGrid

TextGrid is a new Grid project in the framework of the German D-Grid initiative, with the aim to deploy Grid technologies for humanities scholars working on historical (German) texts. Its two roots, humanities computing and eScience (Grid computing used by research together with modern communication technologies), are the basis for TextGrid to provide pioneer work in eHumanities. After summarizing Humanities Computing and modern network technologies, community expectations in the fields of philological edition and other application areas are set forth, from which functional requirements such as modularity, distribution, etc. are distilled. The first version of the TextGrid architecture was designed in accordance with these requirements, and focuses on openness by standard conformance and encapsulation. It provides storage Grid services via a pure Web Services interface to dedicated Web Services tools for different aspects of text processing, analysis and retrieval. This platform aims to provide easily usable tools for scholars, but also specifies interfaces for external program developers to add functionality.

In Germany’s D-Grid project numerous Grid communities are working together to provide a common overarching Grid infrastructure. The major aims of D-Grid are the integration of existing Grid deployments and their interoperability. The challenge lies in the heterogeneity of the current implementations: three Grid middleware stacks and different Virtual Organization management approaches have to be embraced to achieve the intended goals. In this article we focus on the implementation of an attribute-based authorization infrastructure that not only leverages the well-known VO attributes but also campus attributes managed by a Shibboleth federation.

The IVOM project [1] requires in work package 2 the specification of both functional and nonfunctional requirements for the VO-management in D-Grid from a community perspective. These requirements will be the driving forces behind the conceptual work in AP3. This paper focuses on the specification of these requirements.

Investigation in the possibilities to reference between the X.521 naming and the Domain Componant naming

The aim of this deliverable is to investigate the possibilities to reference between the X.521 [X.521] naming and the Domain Component naming [RFC 2247] as well as the referral mechanisms to set up a Directory Information Tree between LDAP servers via distribution of knowledge information. The results of the DIRECT Project and the current respective IETF efforts are base for this item.

GeoTwain: Geospatial analysis and visualization for researchers of transculturality

In our research cluster on transculturality many projects deal with georeferenced data. This paper offers an introduction to the new project GeoTwain that works on visualisation techniques for such data. Based on an analysis of the global telegraph network as an example of transcultural research using geo-referenced data, the paper derives user requirements by combining experiences gained from previous projects and specifies a set of solutions. It is the aim of GeoTwain to provide easy visualization of 4-D-information based on Google Earth and to grasp spatial relationships embedded in historical evidence to analyse, recombine and disaggregate geo-referenced historical data without having to use more specialized and highly complex GIS Tools. Envisioned visualization with GeoTwain allows for fast and efficient assessment of georeferencing's analytical potential in any given case; it also allows the user to carefully weigh further investments in data enrichment in relation to expected findings. Both the development and application of GeoTwain are embedded in the broader research environment infrastructure called Heidelberg Research Architecture (HRA).

This article describes the life cycle of a TEI Document within TextGrid, an eHumanities platform for scholarly text processing, in which structured search is based on the TEI framework and metadata with restricted values. A workbench is provided that offers tools for handling TEI documents, TextGridLab, making it easier to annotate, process, search, and persistently store new digitized texts. The digitization and annotation of the Campe dictionary1 serves as a first test bed. The overall framework of TextGrid is very generic and can handle different types of text (literary editions, linguistic corpora, lexica) as well as heterogeneous data formats (plain text, XML/TEI, images). In fact, the TextGrid repository, TextGridRep, is designed as a digital virtual library over federated archives, where humanities projects are invited to participate. Sharing of data is enabled by means of a grid-based architecture. Specifically the middleware includes most of the treatment of authorization, search, and file management. TextGrid is entirely based on open source software including Eclipse2 and Globus Toolkit.

Access to the DARIAH Bit Preservation Service for Humanities Research Data

Sustainable management of large amounts of research data is gaining in importance for research projects all over the world. The European project DARIAH aims to address this topic for the arts and humanities community. The DARIAH Bit Preservation, as a part of an archiving system for the arts and humanities, allows for a high performance, sustainable, and distributed storage of research data as basis of virtual research environments. A great challenge in designing such a service is to provide a standardized, consistent yet easy-to-use API for accessing the data that remains stable even if backend technology changes over time. As a solution, this paper presents the RESTful API of the DARIAH Bit Preservation which includes an administrative extension, and which is secured by an Authentication and Authorization Infrastructure (AAI) based on SAML. An exemplary implementation illustrates that the API offers distributed access by usage of the HTTP protocol and is able to handle a high number of files. Data transfer rates of up to 45 MB/s were achieved for uploading large files in the local network.

This white-paper expresses common requirements of Research Communities seeking to leverage Identity Federation for Authentication and Authorisation. Recommendations are made to Stakeholders to guide the future evolution of Federated Identity Management in a direction that better satisfies research use cases. The authors represent research communities, Research Services, Infrastructures, Identity Federations and Interfederations, with a joint motivation to ease collaboration for distributed researchers. The content has been edited collaboratively by the Federated Identity Management for Research (FIM4R) Community, with input sought at conferences and meetings in Europe, Asia and North America.