- MySQL 5.1 Reference Manual :: ... :: 12.7.4 Full-Text Restrictions
Ideographic languages such as Chinese and Japanese do not have word delimiters. Therefore, the FULLTEXT parser cannot determine where words begin and end in these and other such languages. The implications of this and some workarounds for the problem are described in Section 12.7, “Full-Text Search Functions”.
- MySQL Bugs: #4158: MySQL Fulltext Index doesn't Work for asian languages
it's a known definiency - we don't have a correct algorithm of splitting Chinese
(or Japanese) text into words. The workaround is to put non-word chartacters
between words.
這兩天發現了 讓 MySQL4.0 FullText 全文檢索支持中文 這一個站,裡面提到用開發 MySQL UDF 或 plugin 的方式幫助中文分詞,似乎是個可行的方式。
0 意見:
張貼意見