Semantic Scholar Open Access 2008 11471 sitasi

Latent semantic analysis

T. Landauer S. Dumais

Abstrak

A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher-order structure in the association of terms with documents ("semantic structure") in order to improve the detection of relevant documents on the basis of terms found in queries. The particular technique used is singular-value decomposition, in which a large term by document matrix is decomposed into a set of ca 100 orthogonal factors from which the original matrix can be approximated by linear combination. Documents are represented by ca 100 item vectors of factor weights. Queries are represented as pseudo-document vectors formed from weighted combinations of terms, and documents with supra-threshold cosine values are returned. Initial tests find this completely automatic method for retrieval to be promising.

Topik & Kata Kunci

Penulis (2)

T

T. Landauer

S

S. Dumais

Format Sitasi

Landauer, T., Dumais, S. (2008). Latent semantic analysis. https://doi.org/10.4249/scholarpedia.4356

Akses Cepat

Lihat di Sumber doi.org/10.4249/scholarpedia.4356
Informasi Jurnal
Tahun Terbit
2008
Bahasa
en
Total Sitasi
11471×
Sumber Database
Semantic Scholar
DOI
10.4249/scholarpedia.4356
Akses
Open Access ✓