Computer Science as Infrastructure: the Spine of the Lean Computer Science Library (CSLib)
Christopher Henson, Fabrizio Montesi
Following in the footsteps of the success of Mathlib - the centralised library of formalised mathematics in Lean - CSLib is a rapidly-growing centralised library of formalised computer science and software. In this paper, we present its founding technical principles, operation, abstractions, and semantic framework. We contribute reusable semantic interfaces (reduction and labelled transition systems), proof automation, CI/testing support for maintaining automation and compatibility with Mathlib, and the first substantial developments of languages and models.
Responsible Intelligence in Practice: A Fairness Audit of Open Large Language Models for Library Reference Services
Haining Wang, Jason Clark, Angelica Peña
As libraries explore large language models (LLMs) as a scalable layer for reference services, a core fairness question follows: can LLM-based services support all patrons fairly, regardless of demographic identity? While LLMs offer great potential for broadening access to information assistance, they may also reproduce societal biases embedded in their training data, potentially undermining libraries' commitments to impartial service. In this chapter, we apply a systematic evaluation approach that combines diagnostic classification to detect systematic differences with linguistic analysis to interpret their sources. Across three widely used open models (Llama-3.1 8B, Gemma-2 9B, and Ministral 8B), we find no compelling evidence of systematic differentiation by race/ethnicity, and only minor evidence of sex-linked differentiation in one model. We discuss implications for responsible AI adoption in libraries and the importance of ongoing monitoring in aligning LLM-based services with core professional values.
The impact of digitisation on the conceptual system of library science
O. Nalyvaiko
The purpose of the article is to study the manifestations of the impact of digitalisation on the development of the conceptual system of librarianship. Research methodology. Methods of comparison, logical generalisations, content analysis of scientific publications in domestic and foreign scientific journals, encyclopaedias, dictionaries, and regulatory documents were used to determine the semantic structure of the terms electronic document, electronic collection, electronic library fund, electronic library, and to establish the relationship between the terms electronic and digital. The scientific novelty of the work lies in clarifying and refining the relationship between the semantics of library terms during the transition from the traditional concept of preserving and organising collections to the creation of electronic repositories with structured content and functional links between sources. Conclusions. Domestic and international terminology standards, encyclopaedic and other terminological sources indicate differences in the definitions of terms containing the terms electronic and digital. Electronic resources cover a wider range of electronic formats, including digital documents, as well as multimedia and interactive objects. A digital document exists exclusively in digital format or has been digitised from analogue media. The term digital library is more accurate than the term electronic library, as it more correctly reflects the methods of presenting information. The process of forming electronic library collections is unfolding in two directions: the creation of electronic documents without printed counterparts and the digitisation of traditional publications. It has been proven that the electronic library collection, as a subset of the library collection, covers all electronic resources of the library, while the electronic collection is a subset of the collection formed according to certain criteria. An electronic document is interpreted as a separate information object that requires ensuring the integrity of meta-information, format compatibility, and the implementation of new semantic models of bibliographic descriptions.
Guarding against artificial intelligence – hallucinated citations: The case for full-text reference deposit.
Alex Glynn
The tendency of generative artificial intelligence (AI) to ‘hallucinate’ false information is well known; AI-generated citations to non-existent sources have penetrated the bibliographies of peer-reviewed publications. Drawing from the Transparency and Openness Promotion guidelines, American judicial contention with generative AI, and the submission of prior art to the US Patent and Trademark Office, the author proposes that journals require authors to submit the full text of each cited source along with their manuscripts, thereby preventing authors from citing material whose full text they cannot produce. This solution requires limited additional work by authors or editors while effectively immunizing journals against hallucinated references.
Academies and learned societies, Bibliography. Library science. Information resources
Exploring the potential of large language models and generative artificial intelligence (GPT): Applications in Library and Information Science
Matús Formánek
The presented study offers a systematic overview of the potential application of large language models (LLMs) and generative artificial intelligence tools, notably the GPT model and the ChatGPT interface, within the realm of library and information science (LIS). The paper supplements and extends the outcomes of a comprehensive information survey on the subject matter with the author’s own experiences and examples showcasing possible applications, demonstrated through illustrative instances. This study does not involve testing available LLMs or selecting the most suitable tool; instead, it targets information professionals, specialists, librarians, and scientists, aiming to inspire them in various ways. Within this paper, we explore both well-known and less recognized use cases of generative AI tools, which may prove relevant not only for the target group of information specialists but also for other users. Our analysis demonstrates that apart from merely summarizing or expanding existing textual content, these AI tools hold the potential for performing non-standard yet sophisticated tasks with electronic information resources. They can facilitate interactive engagement with these resources, aid in the extraction and composition of descriptive metadata, indexing, and even possible classification. Nevertheless, it is essential to acknowledge the numerous limitations of current LLMs, which we acknowledge in this study.
19 sitasi
en
Computer Science
Media Reports as a Source of Information for Creating Advisory Bibliographic Resources for the Development of Scientific Worldview
A. Yuklyaevskaya
Dissemination and updating advisory bibliographic resources on scientific knowledge, promoting a deep understanding of scientific problems by the public are becoming most important tasks of the library. The purpose of the article is to show the possibility of using messages from the mass media as one of the sources for creating such bibliographic resources. The author presents the results of monitoring 146 Russian media using the Medialogy database (69 federal, 77 regional), conducted in March – April 2023. These analyzed media reports have allowed the identification of objects (organizations, persons, project names) involved in the communication of science and society. As a result of the study, a list of 334 messages served as the most notable informational occasions has been formed. These messages can be the objects of bibliographic description, as well as factual information for creation advisory bibliographic resources (lists, reviews, newsletters, digests, databases, guidebooks) aimed at increasing public awareness of the development of science and disseminating scientific literacy.
An Integrated Web-Based Platform for Student Library and Information Sciences Education: Architecture and Implementation
Pavel Goranov
This paper presents an integrated web-based learning platform for library and information science students, focusing on its architecture and implementation. The platform provides capabilities for creating bibliographic records, bibliographic references, and lists, based on the MARC 21 standard for bibliographic description. Students can view bibliographic records in ISBD format, ensuring completeness and accuracy of information resources. The developed system provides an intuitive user interface and complies with modern digital learning requirements, facilitating the professional development of future librarians
Model of Publishing Activities of Scientific Libraries
Zlyhostiev Serhiy
The article highlights the peculiarities of implementing a publishing model. The aspects that indicate the importance of creating such a model are revealed. The publishing model makes it possible to take into account all the factors that may affect this process. The issue of the publishing activity model is almost not covered in the scientific literature, which is of considerable scientific interest. To effectively organize any activity, you need to set aside time and form a model of such activity. In the course of our study, we managed to investigate the peculiarities of the publishing activities of libraries and identify aspects that can be the basis for a model of publishing activities. The optimal model of publishing activity is proposed. The model is based on the analysis of scientific papers and publishing activities of national libraries and libraries of higher education institutions. By the model of publishing activity, we mean the strategy by which the preparation, production and distribution of publishing products are carried out. It is worth noting that the developed model is a valuable tool for assessing the current state of publishing activities of libraries, identifying priority areas of development and developing a strategy that will help to increase the efficiency and profitability of these activities. It is worth noting that the developed model is a valuable tool for assessing the current state of library publishing activities, identifying priority areas for development, and developing a strategy that will help increase the efficiency and profitability of these activities. In creating the model, we tried to take into account all the important aspects of library publishing activities and aspects that can improve these activities. We do not exclude the possibility that in the process of a deeper analysis of the publishing activities of libraries in certain categories, certain changes may be made to the model to adapt it to a particular library.
Bibliography. Library science. Information resources
Activities of Russian Libraries in the Field of Retrospective Scientific and Auxiliary Bibliography: Issues of Theory and Modern Practice
G. L. Levin
Libraries of various types and kinds, starting from the late 20s of the XX century, have been the main centres for the preparation of domestic bibliographic products, among which special place is taken by retrospective indexes of literature of a summarizing nature which require significant research work. Scientific and auxiliary bibliography acts as an organic part of scientific and information activity. Moreover, it represents an integral component of science as a social system. The article provides an overview of retrospective bibliographic resources of a fundamental nature published by federal and regional libraries of Russia in 2000-2023.
Information Support of Innovation Activities by Libraries: the Experience of Library for Natural Sciences of RAS
I. Mitroshin
The article discusses the experience of the patent sector of the Library for Natural Sciences of the Russian Academy of Sciences in the field of supporting research and innovation activities of organizations served by the library. The main activities of the sector are presented at: information services, consulting, methodological and analytical support for users. The activity of the sector is connected with patent and other structural divisions of research institutes for expanding the range of services, for improving the quality of service, for improving own information systems and resources. Directions for further research and development are determined, the effectiveness of the created results of intellectual activity (RIA) is evaluated, and the possibility of patenting inventions and utility models is assessed. It was based on the results of the analysis of documentation (both patent data and bibliographic data). The data of ongoing research are useful for the management of the research institute and for the scientific and engineering staff. The possibility of participation of scientific and scientific-technical libraries in conducting patent research and building patent landscapes is considered. Conclusions are drawn about the prospects of using these areas in the activities of libraries. Working with intellectual property is becoming an important and promising direction in the work of scientific libraries and helps to attract new users. This line of work strengthens the reputation of libraries as a reliable source of information, serves to strengthen links with the scientific and innovation communities, and to highlight new research and development. The need for interaction with other departments of the library is noted. In particular, the work of the patent sector is connected with the information and bibliographic department. We use modern approaches to research based on new information technologies and indicators of bibliometric statistics, as well as methods of comparative analysis of publication activity, citation analysis, etc. This line of activity helps us to stay relevant and competitive in a rapidly changing world of information and knowledge. In the future, this direction can bring significant benefits for various industries, science and technology, including the promotion of innovation and economic growth in general.
Publish for Public: Improving Access of Public Libraries Users to Research Findings through Plain Language Summaries
Behrooz Rasuli
Public libraries play a crucial role in disseminating knowledge to society. However, most of their users do not have the specialized knowledge to understand the new research findings. Providing plain language summaries (PLSs) in public libraries is a way to make the new research findings more accessible and understandable for the public. This article proposes a framework for providing PLSs as a new service in public libraries. Drawing from the literature on science and society, PLSs, and public libraries, a theoretical framework is developed. The findings suggest that public libraries can collect PLSs through different methods, such as professional teams, researchers, crowdsourcing, etc. Library newsletters, special publications, brochures, independent online databases, and social networks are among the most effective for making PLSs accessible to users. By proposing a framework for providing PLSs in public libraries, this study helps to bridge the gap between scientific research and the public.
A bibliometric study on library and information science and information systems literature during 2010-2019
Manjula Wijewickrema
PurposeStudying the nature of research progress in interrelated research domains is important for evaluating the research productivity and to understand the current trends of the area of research. This study aims to examine a research domain that combines library and information science with information systems (IS).Design/methodology/approachQuartile 1 journals that cover both subject domains in SCImago were selected for the study. Bibliographic records of the publications during 2010 and 2019 were retrieved from the Scopus database. VOSviewer data visualization tool was used to perform citation, coauthorship, bibliographic coupling, cocitation and co-occurrence analysis. In addition, descriptive and inferential statistics were exploited.FindingsThe absence of a consistent association between the number of documents authored and the number of citations received by a researcher was an important finding of the study. The strong association of authors regardless of the different topics they researched and the trend of increasing interest on collaborative research were also highlighted. Moreover, the authors who received the highest number of citations were not always the first authors of the documents which received the most citations. The documents published in information management, information theory and IS journals attained the most citations. The journals, institutions and countries with the highest number of documents and citations were also revealed by the research. Electrical engineering departments showed a higher research productivity, while they were utilized more compared to that of other departments. IS and management, information theory, communication, information retrieval, geographic-based IS and bioinformatics were the six major research areas of the considered domain.Originality/valueThis is the first study related to examining the research progress in a combined subject domain using multiple aspects including, individual performance, institutional progress, geographical contribution and so on. Identifying the major research areas in the combined subject domain can also be considered a novel contribution to the field.
29 sitasi
en
Computer Science
Preservation Practices for Information Resources in Public University Libraries in Tanzania
Christian. Mubofu, H. Mambo, A. Samzugi
Abstract The present study examined the preservation practices for information resources in seven public university libraries in Tanzania. Convenient and purposive sampling techniques were used to recruit library staff and library directors respectively to participate in the study. Data were collected through questionnaires and semi-structured interviews. The quantitative data obtained from the questionnaires were analysed using the Statistical Product and Service Solution (version 20) software while the qualitative data from the interviews were analysed using the thematic data analysis method. Regarding the effective preservation approaches for information resources, the study established that binding, digitization, migration, and emergency preparedness plans were highly cited. The following were the recommended preservation approaches for adoption in public university libraries: digitization, refreshing, proper shelving, and cleanliness of all library areas, migration, technology preservation, and binding. From the findings, it was concluded that public university libraries have a variety of preservation approaches for both print and digital information resources. The study recommends that policymakers and the Ministry of Education, Science, and Technology should train librarians in skills that they can apply in preserving information resources using modern technologies.
Scientific Bibliography in the State Public Scientific Technological Library of the Siberian Branch of the Russian Academy of Sciences: Stages of Development, Modern Realities and Prospects
T. Busygina, L. A. Mandrinina, V. Rykova
et al.
The article objective is to show the bibliographic activity development on information research support in the State Public Scientific Technological Library of the Siberian Branch of the Russian Academy of Sciences (SPSTL SB RAS). Namely, changes in the technology of creating information resources of own generation due to computers and software; to represent the product types created by employees during 1958–2021; to highlight topics of traditional bibliographic current and retrospective indexes, bibliographic and full-text databases. The paper presents quantitative characteristics of the generated electronic resources; shows the forms of bibliographic services: information selective dissemination, differentiated services for managers, electronic thematic exhibitions of literature based on the databases; gives a brief description of bibliometric research. The authors determine prospects to develop the number of SPSTL SB RAS information and bibliographic complex. They are growing the authoritative data files in bibliographic databases, expanding the repertoire of Open Access sources to select information for regional bibliographic resources, developing interactive forms of information services for scientists and specialists; participating the semantic web through linked data technology based on authoritative files as an object (predicate), improving the service capabilities of the information search system (access to full texts).
A Relationships-based Algorithm for Detecting the Communities in Social Networks
Sevda Fotovvat, Habib Izadkhah , Javad Hajipour
Social network research analyzes the relationships between interactions, people, organizations, and entities. With the developing reputation of social media, community detection is drawing the attention of researchers. The purpose of community detection is to divide social networks into groups. These communities are made of entities that are very closely related. Communities are defined as groups of nodes or summits that have strong relationships among themselves rather than between themselves. The clustering of social networks is important for revealing the basic structures of social networks and discovering the hyperlink of systems on human beings and their interactions. Social networks can be represented by graphs where users are shown with the nodes of the graph and the relationships between the users are shown with the edges. Communities are detected through clustering algorithms. In this paper, we proposed a new clustering algorithm that takes into account the extent of relationships among people. Outcomes from particular data suggest that taking into account the profundity of people-to-people relationships increases the correctness of the aggregation methods.
Information resources (General)
Information measures and geometry of the hyperbolic exponential families of Poincaré and hyperboloid distributions
Frank Nielsen, Kazuki Okamura
We study various information-theoretic measures and the information geometry of the Poincaré distributions and the related hyperboloid distributions, and prove that their statistical mixture models are universal density estimators of smooth densities in hyperbolic spaces. The Poincaré and the hyperboloid distributions are two types of hyperbolic probability distributions defined using different models of hyperbolic geometry. Namely, the Poincaré distributions form a triparametric bivariate exponential family whose sample space is the hyperbolic Poincaré upper-half plane and natural parameter space is the open 3D convex cone of two-by-two positive-definite matrices. The family of hyperboloid distributions form another exponential family which has sample space the forward sheet of the two-sheeted unit hyperboloid modeling hyperbolic geometry. In the first part, we prove that all $f$-divergences between Poincaré distributions can be expressed using three canonical terms using Eaton's framework of maximal group invariance. We also show that the $f$-divergences between any two Poincaré distributions are asymmetric except when those distributions belong to a same leaf of a particular foliation of the parameter space. We report closed-form formula for the Fisher information matrix, the Shannon's differential entropy and the Kullback-Leibler divergence. and Bhattacharyya distances between such distributions using the framework of exponential families. In the second part, we state the corresponding results for the exponential family of hyperboloid distributions by highlighting a parameter correspondence between the Poincaré and the hyperboloid distributions. Finally, we describe a random generator to draw variates and present two Monte Carlo methods to stochastically estimate numerically $f$-divergences between hyperbolic distributions.
galstreams: A Library of Milky Way Stellar Stream Footprints and Tracks
Cecilia Mateu
Nearly a hundred stellar streams have been found to date around the Milky Way and the number keeps growing at an ever faster pace. Here we present the galstreams library, a compendium of angular position, distance, proper motion and radial velocity track data for nearly a hundred (95) Galactic stellar streams. The information published in the literature has been collated and homogenised in a consistent format and used to provide a set of features uniformly computed throughout the library: e.g. stream length, end points, mean pole, stream's coordinate frame, polygon footprint, and pole and angular momentum tracks. We also use the information compiled to analyse the distribution of several observables across the library and to assess where the main deficiencies are found in the characterisation of individual stellar streams, as a resource for future follow-up efforts. The library is intended to facilitate keeping track of new discoveries and to encourage the use of automated methods to characterise and study the ensemble of known stellar streams by serving as a starting point. The library is publicly available as a Python package and served at the galstreams GitHub repository.
Publicación científica en acceso abierto: desafíos decoloniales para América Latina
Nancy Sanchez Tarrago
A través de los conceptos de colonialidades, geopolítica y corpo-politica del conocimiento se reflexiona sobre algunos desafíos de la publicación científica latinoamericana en acceso abierto. Aunque América Latina es pionera en iniciativas de acceso abierto y en la creación de sistemas regionales cooperativos para compartir conocimiento como bien común, las revistas “internacionales”, refrendadas por factor de impacto, continúan a ser priorizadas en los sistemas de evaluación y financiamiento de la ciencia de la mayoría de los países de la región. Adicionalmente, estrategias de mercantilización del acceso abierto se hacen cada vez más penetrantes y amenazan con subvertir algunos de los propósitos iniciales del Movimiento de acceso abierto y crear brechas más profundas entre el Norte y el Sur. Por detrás de estos aspectos se sitúa la naturalización y perpetuación de jerarquías y exclusiones ontológicas y epistémicas con trasfondos de racismo sistémico que autores decoloniales caracterizan como colonialidades. Se requiere una desobediencia epistémica, como actitud decolonial, y una concertación de esfuerzos a nivel regional que transforme sistemas de evaluación, preserve el carácter público y académico de la ciencia y garantice equidad y justicia social
Bibliography. Library science. Information resources, Information resources (General)
Francesco Griffo in Padua. From Goldsmith to 'Grammatoglypta' (1470-1480)
Chiara Reatti, Paolo Tinti
The essay outlines the state of knowledge on Francesco Griffo from Bologna and presents new and valuable archival evidences found in the State Archive of Padua. The already known documents are combined with the significant set of recently discovered papers, in order to investigate the decade that Griffo spent in Padua, a city that was crucial for his biographical and professional career. There he came as a goldsmith, attracted by the artistic fervor that surrounded the Basilica of Saint Anthony, and there Griffo made the gradual transition to the engraving of printing types, working for some of the protagonists of the early printing and establishing professional relationships that would prove decisive for his future, in Venice and elsewhere.
Bibliography. Library science. Information resources
Can Information Flows Suggest Targets for Interventions in Neural Circuits?
Praveen Venkatesh, Sanghamitra Dutta, Neil Mehta
et al.
Motivated by neuroscientific and clinical applications, we empirically examine whether observational measures of information flow can suggest interventions. We do so by performing experiments on artificial neural networks in the context of fairness in machine learning, where the goal is to induce fairness in the system through interventions. Using our recently developed $M$-information flow framework, we measure the flow of information about the true label (responsible for accuracy, and hence desirable), and separately, the flow of information about a protected attribute (responsible for bias, and hence undesirable) on the edges of a trained neural network. We then compare the flow magnitudes against the effect of intervening on those edges by pruning. We show that pruning edges that carry larger information flows about the protected attribute reduces bias at the output to a greater extent. This demonstrates that $M$-information flow can meaningfully suggest targets for interventions, answering the title's question in the affirmative. We also evaluate bias-accuracy tradeoffs for different intervention strategies, to analyze how one might use estimates of desirable and undesirable information flows (here, accuracy and bias flows) to inform interventions that preserve the former while reducing the latter.