We formally define the literature (reference) snowballing method and present a refined version of it. We show that the improved algorithm can substantially reduce curator work, even before application of text classification, by reducing the number of candidates to classify. We also present a desktop application named LitBall that implements this and other literature collection methods, through access to the Semantic Scholar academic graph (S2AG).
We have analyzed all preprints on ChatGPT (N=501) and selected the most influential preprints (according to Altmetric) about ChatGPT across scientific disciplines to provide the most discussed research results about ChatGPT. We prompted ChatGPT to create a structured review article based on them. The results are surprisingly promising, suggesting that the future of creating review articles can lie in ChatGPT.
This paper introduces reproducible research, and explains its importance, benefits and challenges. Some important tools for conducting reproducible research in Transportation Research are also introduced. Moreover, the source code for generating this paper has been designed in a way so that it can be used as a template for researchers to write their future journal papers as dynamic and reproducible documents.
The objective of this paper is to present a meta-corpus of diplomatic documents entitled Cartae Europae Medii Aevi or CEMA. It shows the logic and limits of this meta-corpus, which contains 250,000 documents, by specifying both its structure and its possible future extensions. The second part of the paper is devoted to specific examples that will attempt to show the interest of such a database. The third part examine the possibilities opened up by the corpus in terms of historical semantics.
Systematic reviews and meta-analyses of mediation studies are increasingly being implemented in practice. Nonetheless, the methodology for conducting such review and analysis is still in a development phase, with much room for improvement. In this paper, we highlight and discuss challenges that investigators face in mediation systematic reviews and meta-analyses, then propose ways of accommodating these in practice.
Success in academia hinges on publishing in top tier journals. This requires innovative results. And this requires clear and convincing presentation of said results. Presentation can make the difference of one tier in journal level. A lot of useful advice on this topic is available online from well-respected outlets; see, for example, El-Omar (2014); Gould (2014); Neiles et al. (2015); Notz and Kafadar (2011); or Sachdeva (2020). This text provides a different angle.
Lifecycle models for research data are often abstract and simple. This comes at the danger of oversimplifying the complex concepts of research data management. The analysis of 90 different lifecycle models lead to two approaches to assess the quality of these models. While terminological issues make direct comparisons of models hard, an empirical evaluation seems possible.
We propose the use of beamplots - which can be produced by using the R package BibPlots and WoS downloads - as a preferred alternative to h index values for assessing single researchers.
Daniel Torres-Salinas, Christian Gumpenberger, Juan Gorraiz
The main purpose of this macro-study is to shed light on the broad impact of books. For this purpose, the impact of a very large collection of books has been analyzed by using PlumX, an analytical tool providing a great number of different metrics provided by various tools.
A toy-model of publications and citations processes is proposed. The model shows that the role of randomness in the processes is essential and cannot be ignored. Some other aspects of scientific publications rating are discussed.
The importance of dimensional analysis and dimensional homogeneity in bibliometric studies is always overlooked. In this paper, we look at this issue systematically and show that most h-type indices have the dimensions of [P], where [P] is the basic dimensional unit in bibliometrics which is the unit publication or paper. The newly introduced Euclidean index, based on the Euclidean length of the citation vector has the dimensions [P3/2]. An empirical example is used to illustrate the concepts.
In this comment, I discuss the use of statistical inference in citation analysis. In a recent paper, Williams and Bornmann argue in favor of the use of statistical inference in citation analysis. I present a critical analysis of their arguments and of similar arguments provided elsewhere in the literature. My conclusion is that the use of statistical inference in citation analysis involves major conceptual difficulties and, consequently, that the usefulness of statistical inference in citation analysis is highly questionable.
The social network analysis of bibliometric data needs matrices to be recast in a network framework. In this paper we argue that a simple conservation rule requires that this should be done only using fractional counting so that conservation at the paper level will be faithfully reproduced at higher levels ofaggregation (i.e. author, institute, country, journal etc.) of the complex network.
The reverse transcription paradigm shift in RNA tumor virus research marked by the discovery of the reverse transcriptase in 1970 was traced using co-citation and title word frequency analysis. It is shown that this event is associated with a break in citation patterns and the occurrence of previously unknown technical terms.
The author of this letter to the editor would like to set forth the argument that scientometrics is currently in a phase in which a taxonomic change, and hence a revolution, is taking place. One of the key terms in scientometrics is scientific impact which nowadays is understood to mean not only the impact on science but the impact on every area of society.
One of the main tasks of the European Digital Mathematics Library project was to define a cooperation model with a variety of stakeholders that would allow building a reliable and durable global reference library, aiming to be eventually exhaustive. In this paper we present the EuDML external cooperation model and the business plan as the basis for its sustainability and further development.
One way of evaluating individual scientists is the determination of the number of highly cited publications, where the threshold is given by a large reference set. It is shown that this indicator behaves in a counterintuitive way, leading to inconsistencies in the ranking of different scientists.
In this short communication I highlight how the number of collaborators on papers in the main astronomy journals has evolved over time. We see a trend of moving away from single-author papers. This communication is based on data in the holdings of the SAO/NASA Astrophysics Data System (ADS). The ADS is funded by NASA Grant NNX09AB39G.
Wikis provide a new way of collaboration and knowledge sharing. Wikis are software that allows users to work collectively on a web-based knowledge base. Wikis are characterised by a sense of anarchism, collaboration, connectivity, organic development and self-healing, and they rely on trust. We list several concerns about applying wikis in professional organisation. After these concerns are met, wikis can provide a progessive, new knowledge sharing and collaboration tool.