Semantic Scholar Open Access 1998 1924 sitasi

Automatic Retrieval and Clustering of Similar Words

Dekang Lin

Abstrak

Bootstrapping semantics from text is one of the greatest challenges in natural language learning. We first define a word similarity measure based on the distributional pattern of words. The similarity measure allows us to construct a thesaurus using a parsed corpus. We then present a new evaluation methodology for the automatically constructed thesaurus. The evaluation results show that the thesaurus is significantly closer to WordNet than Roget Thesaurus is.

Topik & Kata Kunci

Penulis (1)

D

Dekang Lin

Format Sitasi

Lin, D. (1998). Automatic Retrieval and Clustering of Similar Words. https://doi.org/10.3115/980691.980696

Akses Cepat

Lihat di Sumber doi.org/10.3115/980691.980696
Informasi Jurnal
Tahun Terbit
1998
Bahasa
en
Total Sitasi
1924×
Sumber Database
Semantic Scholar
DOI
10.3115/980691.980696
Akses
Open Access ✓