DOAJ Open Access 2007

On Correlation Polynomials and Subword Complexity

Irina Gheorghiciuc Mark Daniel Ward

Abstrak

We consider words with letters from a $q-ary$ alphabet $\mathcal{A}$. The kth subword complexity of a word $w ∈\mathcal{A}^*$ is the number of distinct subwords of length $k$ that appear as contiguous subwords of $w$. We analyze subword complexity from both combinatorial and probabilistic viewpoints. Our first main result is a precise analysis of the expected $kth$ subword complexity of a randomly-chosen word $w ∈\mathcal{A}^n$. Our other main result describes, for $w ∈\mathcal{A}^*$, the degree to which one understands the set of all subwords of $w$, provided that one knows only the set of all subwords of some particular length $k$. Our methods rely upon a precise characterization of overlaps between words of length $k$. We use three kinds of correlation polynomials of words of length $k$: unweighted correlation polynomials; correlation polynomials associated to a Bernoulli source; and generalized multivariate correlation polynomials. We survey previously-known results about such polynomials, and we also present some new results concerning correlation polynomials.

Topik & Kata Kunci

Penulis (2)

I

Irina Gheorghiciuc

M

Mark Daniel Ward

Format Sitasi

Gheorghiciuc, I., Ward, M.D. (2007). On Correlation Polynomials and Subword Complexity. https://doi.org/10.46298/dmtcs.3553

Akses Cepat

Lihat di Sumber doi.org/10.46298/dmtcs.3553
Informasi Jurnal
Tahun Terbit
2007
Sumber Database
DOAJ
DOI
10.46298/dmtcs.3553
Akses
Open Access ✓