J-GLOBAL MeSH Dictionary

Data detail

A user dictionary for morphological analysis engine MeCab(http://taku910.github.io/mecab/) from J-GLOBAL science and technology terms that have linked to MeSH (Medical Subect Headings: https://www.nlm.nih.gov/mesh/) terms by United States National Library of Medicine. The dictionary items are based on the IPA dictionary and encoded in UTF-8.

The cost for the likelihood of the word to appear in a sentence (smaller, more likely)

POS

Part of speech

POS subcategory 1

POS subcategory 1

POS subcategory 2

POS subcategory 2

POS subcategory 3

POS subcategory 3

Conjugation type

Conjugation type

Conjugation form

Conjugation form

Base form

Same as the surface form

Reading('Furigana')

(empty)

Pronunciation

(empty)

Source dictionary

It is fixed as 'MeSH'.

ID in Source dictionary

MeSH UID

J-GLOBAL ID

ID in J-GLOBAL

Headword Flag

It is fixed as 'C'.

Category code

Category code of science fields in JST Thesaurus

Common word flag 1

・1: There is an entry (or entries) for the surface form in IPA dictionary・0: There are no entries for the surface in IPA dictionary

Common word flag 2

Based on "IPA dictionary analysis results":・When the value of Common word flag １ is 1, the value of this flag is the part of speech for the IPA dictionary analysis result.・When the value of Common word flag １ is 0:- UNKNOWN_1: if the result is one unknown word- UNKNOWN_2: if the result is multiple tokens including unknown word- MULTI_WORD: if the result is multiple tokens in IPA dictionary

IPA dictionary analysis results

Results of the morphological analysis with the original IPA dictionary (and the dictionary with IPA dictionary entries where zenkaku alphanumeric characters and symbols are converted into corresponding hankaku characters). If the result is devided into multiple tokens, it is whitespace-separated. It is not manually corrected.