A potential, severe security problem in utilization of Unicode in the Web is identified, which derives from the fact that there are too many similar characters in the Unicode Character Set (UCS). The foundation of solving such a problem relies on the solution of evaluating the similarity of characters in UCS. We propose using a renowned Kernel Density Estimation (KDE) solution to establish such a Unicode Similarity List (UC-SimList).