arXiv Open Access 2012

Design, implementation and experiment of a YeSQL Web Crawler

Pierre Joulin Romain Deveaud Eric SanJuan-Ibekwe Jean-Marc Francony Françoise Para
Lihat Sumber

Abstrak

We describe a novel, "focusable", scalable, distributed web crawler based on GNU/Linux and PostgreSQL that we designed to be easily extendible and which we have released under a GNU public licence. We also report a first use case related to an analysis of Twitter's streams about the french 2012 presidential elections and the URL's it contains.

Topik & Kata Kunci

Penulis (5)

P

Pierre Joulin

R

Romain Deveaud

E

Eric SanJuan-Ibekwe

J

Jean-Marc Francony

F

Françoise Para

Format Sitasi

Joulin, P., Deveaud, R., SanJuan-Ibekwe, E., Francony, J., Para, F. (2012). Design, implementation and experiment of a YeSQL Web Crawler. https://arxiv.org/abs/1212.5633

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2012
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓