Japanese Vowels dataset

Dataset information

The original Japanese Vowels (Vowels) dataset from UCI machine learning repositoryis a multivariate time series data, where nine male speakers uttered two Japanese vowels /ae/ successively. Here, one utterance by a speaker forms a time series whose length is in the range 7-29 and each point of a time series is of 12 features (12 coefficients). This is a classification dataset to classify the speakers. For outlier detection, each frame in the training data is treated as an individual data point, whereas the UCI repository treats a block of frames (utterance) as an individual point. In this case, class (speaker) 1 is downsampled to 50 outliers. The inliers contained classes 6, 7 and 8. Other classes are discarded.