Dear RDWG,
Daniel PÃ¶ll has done some excellent research for us on how to best get
our symposium papers better indexed in scientific search engines.
Below is a summary of the main findings, please let us know if you have
any further thoughts or comments on these findings:
#1. Search Engines
It seems that Google Scholar and Microsoft Academics are the largest
search engine crawlers. There are several others though most seem to be
focused on particular domains and others need to be manually pointed to
the papers in order to index them. A list of search engines in here:
-
<http://en.wikipedia.org/wiki/List_of_academic_databases_and_search_engines>
#2. Metadata Formats
Apparently Dublin Core is not as widely supported for this use as we'd
initially thought. Some of the more widely supported metadata formats
seem to be:
- Highwire Press Tags
- Eprints Tags
- BE Press Tags
- PRISM Tags
Of these Highwire seems to be more widely used and documented. It also
seems that both Google Scholar and Microsoft Academics support it.
#3. Paper Requirements
The guidelines for Google Scholar (which also seem to be supported by
Microsoft Academics) do not have a strong impact on our current paper
structure. It seems we only need to add some <meta> elements to the HTML
code to reflect at least the:
- Title of the document
- Year of publishing
- At least one of the AuthorÂ´s names
Some useful resources found include:
- <http://scholar.google.com/intl/en/scholar/inclusion.html>
-
<http://www.monperrus.net/martin/accurate+bibliographic+metadata+and+google+scholar>
#4. Symposium Papers
We suggest the following <meta> elements to be added to the current HTML
for the symposium papers, to get them better indexed:
- <meta name="citation_title" content="[paper title]" />
- <meta name="citation_author" content="[author, multiple allowed]" />
- <meta name="citation_publication_date" content="[symposium date]" />
- <meta name="citation_online_date" content="[paper online date]" />
- <meta name="citation_conference_title" content="[symposium name]" />
- <meta name="citation_journal_title" content="W3C WAI Research and
Development Working Group (RDWG) Notes" />
- <meta name="citation_technical_report_institution" content="W3C Web
Accessibility Initiative (WAI)" />
Regards,
Shadi
--
Shadi Abou-Zahra - http://www.w3.org/People/shadi/
Activity Lead, W3C/WAI International Program Office
Evaluation and Repair Tools Working Group (ERT WG)
Research and Development Working Group (RDWG)