BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining
Renqian Luo, Liai Sun, Yingce Xia
et al.
Pre-trained language models have attracted increasing attention in the biomedical domain, inspired by their great success in the general natural language domain. Among the two main branches of pre-trained language models in the general language domain, i.e. BERT (and its variants) and GPT (and its variants), the first one has been extensively studied in the biomedical domain, such as BioBERT and PubMedBERT. While they have achieved great success on a variety of discriminative downstream biomedical tasks, the lack of generation ability constrains their application scope. In this paper, we propose BioGPT, a domain-specific generative Transformer language model pre-trained on large-scale biomedical literature. We evaluate BioGPT on six biomedical natural language processing tasks and demonstrate that our model outperforms previous models on most tasks. Especially, we get 44.98%, 38.42% and 40.76% F1 score on BC5CDR, KD-DTI and DDI end-to-end relation extraction tasks, respectively, and 78.2% accuracy on PubMedQA, creating a new record. Our case study on text generation further demonstrates the advantage of BioGPT on biomedical literature to generate fluent descriptions for biomedical terms.
1291 sitasi
en
Computer Science, Medicine
Structural Equation Modeling
Chandan Singh, J. S. Khamba
The chapters demonstrate two SEM programs with distinct user interfaces and capabilities (Amos and Mplus) with enough specificity that readers can conduct their own analyses without consulting additional resources. Examples from social work literature highlight best practices for the specification, estimation, interpretation, and modification of structural equation models. Oftentimes, confirmatory factor analysis and general structure modeling are the most flexible, powerful, and appropriate choices for social work data.
5536 sitasi
en
Physics, Mathematics
Neural Message Passing for Quantum Chemistry
J. Gilmer, S. Schoenholz, Patrick F. Riley
et al.
Supervised learning on molecules has incredible potential to be useful in chemistry, drug discovery, and materials science. Luckily, several promising and closely related neural network models invariant to molecular symmetries have already been described in the literature. These models learn a message passing algorithm and aggregation procedure to compute a function of their entire input graph. At this point, the next step is to find a particularly effective variant of this general approach and apply it to chemical prediction benchmarks until we either solve them or reach the limits of the approach. In this paper, we reformulate existing models into a single common framework we call Message Passing Neural Networks (MPNNs) and explore additional novel variations within this framework. Using MPNNs we demonstrate state of the art results on an important molecular property prediction benchmark; these results are strong enough that we believe future work should focus on datasets with larger molecules or more accurate ground truth labels.
8765 sitasi
en
Computer Science
How ‘Big Data’ Can Make Big Impact: Findings from a Systematic Review and a Longitudinal Case Study
S. Wamba, S. Wamba, Shahriar Akter
et al.
1667 sitasi
en
Computer Science
A New Dimension of Health Care: Systematic Review of the Uses, Benefits, and Limitations of Social Media for Health Communication
A. Moorhead, Diane E Hazlett, L. Harrison
et al.
Background There is currently a lack of information about the uses, benefits, and limitations of social media for health communication among the general public, patients, and health professionals from primary research. Objective To review the current published literature to identify the uses, benefits, and limitations of social media for health communication among the general public, patients, and health professionals, and identify current gaps in the literature to provide recommendations for future health communication research. Methods This paper is a review using a systematic approach. A systematic search of the literature was conducted using nine electronic databases and manual searches to locate peer-reviewed studies published between January 2002 and February 2012. Results The search identified 98 original research studies that included the uses, benefits, and/or limitations of social media for health communication among the general public, patients, and health professionals. The methodological quality of the studies assessed using the Downs and Black instrument was low; this was mainly due to the fact that the vast majority of the studies in this review included limited methodologies and was mainly exploratory and descriptive in nature. Seven main uses of social media for health communication were identified, including focusing on increasing interactions with others, and facilitating, sharing, and obtaining health messages. The six key overarching benefits were identified as (1) increased interactions with others, (2) more available, shared, and tailored information, (3) increased accessibility and widening access to health information, (4) peer/social/emotional support, (5) public health surveillance, and (6) potential to influence health policy. Twelve limitations were identified, primarily consisting of quality concerns and lack of reliability, confidentiality, and privacy. Conclusions Social media brings a new dimension to health care as it offers a medium to be used by the public, patients, and health professionals to communicate about health issues with the possibility of potentially improving health outcomes. Social media is a powerful tool, which offers collaboration between users and is a social interaction mechanism for a range of individuals. Although there are several benefits to the use of social media for health communication, the information exchanged needs to be monitored for quality and reliability, and the users’ confidentiality and privacy need to be maintained. Eight gaps in the literature and key recommendations for future health communication research were provided. Examples of these recommendations include the need to determine the relative effectiveness of different types of social media for health communication using randomized control trials and to explore potential mechanisms for monitoring and enhancing the quality and reliability of health communication using social media. Further robust and comprehensive evaluation and review, using a range of methodologies, are required to establish whether social media improves health communication practice both in the short and long terms.
Review of the psychometric evidence of the perceived stress scale.
Eun-Hyun Lee
PURPOSE The purpose of this study was to review articles related to the psychometric properties of the Perceived Stress Scale (PSS). METHODS Systematic literature searches of computerized databases were performed to identify articles on psychometric evaluation of the PSS. RESULTS The search finally identified 19 articles. Internal consistency reliability, factorial validity, and hypothesis validity of the PSS were well reported. However, the test-retest reliability and criterion validity were relatively rarely evaluated. In general, the psychometric properties of the 10-item PSS were found to be superior to those of the 14-item PSS, while those of the 4-item scale fared the worst. The psychometric properties of the PSS have been evaluated empirically mostly using populations of college students or workers. CONCLUSION Overall, the PSS is an easy-to-use questionnaire with established acceptable psychometric properties. However, future studies should evaluate these psychometric properties in greater depth, and validate the scale using diverse populations.
2090 sitasi
en
Medicine, Psychology
Differential Privacy: A Survey of Results
C. Dwork
4078 sitasi
en
Computer Science
Answering the Call for a Standard Reliability Measure for Coding Data
Andrew F. Hayes, K. Krippendorff
4268 sitasi
en
Computer Science
Toward A Hierarchical Model of Intrinsic and Extrinsic Motivation
R. Vallerand
3373 sitasi
en
Psychology
On sequential Monte Carlo sampling methods for Bayesian filtering
A. Doucet, S. Godsill, C. Andrieu
4928 sitasi
en
Mathematics, Computer Science
Polymorphic transitions in single crystals: A new molecular dynamics method
M. Parrinello, A. Rahman
Strategies for Theorizing from Process Data
A. Langley
An Autoregressive Distributed Lag Modelling Approach to Cointegration Analysis
M Hashem Pesaran, Y. Shin, P. Boswijk
et al.
Learning and Teaching Styles in Engineering Education.
R. Felder, L. Silverman
5737 sitasi
en
Psychology
Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders
A. Hamosh, A. F. Scott, J. Amberger
et al.
Online Mendelian Inheritance in Man (OMIM™) is a comprehensive, authoritative and timely knowledgebase of human genes and genetic disorders compiled to support human genetics research and education and the practice of clinical genetics. Started by Dr Victor A. McKusick as the definitive reference Mendelian Inheritance in Man, OMIM (http://www.ncbi.nlm.nih.gov/omim/) is now distributed electronically by the National Center for Biotechnology Information, where it is integrated with the Entrez suite of databases. Derived from the biomedical literature, OMIM is written and edited at Johns Hopkins University with input from scientists and physicians around the world. Each OMIM entry has a full-text summary of a genetically determined phenotype and/or gene and has numerous links to other genetic databases such as DNA and protein sequence, PubMed references, general and locus-specific mutation databases, HUGO nomenclature, MapViewer, GeneTests, patient support groups and many others. OMIM is an easy and straightforward portal to the burgeoning information in human genetics.
3362 sitasi
en
Biology, Medicine
PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences
M. Lescot, P. Déhais, G. Thijs
et al.
6299 sitasi
en
Biology, Computer Science
The Financial Accelerator in a Quantitative Business Cycle Framework
B. Bernanke, M. Gertler, M. Gertler
et al.
Matching As An Econometric Evaluation Estimator
J. Heckman, Hidehiko Ichimura, Petra E. Todd
4102 sitasi
en
Mathematics
Conversational Exploration of Literature Landscape with LitChat
Mingyu Huang, Shasha Zhou, Yuxuan Chen
et al.
We are living in an era of "big literature", where the volume of digital scientific publications is growing exponentially. While offering new opportunities, this also poses challenges for understanding literature landscapes, as traditional manual reviewing is no longer feasible. Recent large language models (LLMs) have shown strong capabilities for literature comprehension, yet they are incapable of offering "comprehensive, objective, open and transparent" views desired by systematic reviews due to their limited context windows and trust issues like hallucinations. Here we present LitChat, an end-to-end, interactive and conversational literature agent that augments LLM agents with data-driven discovery tools to facilitate literature exploration. LitChat automatically interprets user queries, retrieves relevant sources, constructs knowledge graphs, and employs diverse data-mining techniques to generate evidence-based insights addressing user needs. We illustrate the effectiveness of LitChat via a case study on AI4Health, highlighting its capacity to quickly navigate the users through large-scale literature landscape with data-based evidence that is otherwise infeasible with traditional means.
AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting
Jibang Wu, Chenghao Yang, Yi Wu
et al.
This paper develops an agentic framework that employs large language models (LLMs) for grounded persuasive language generation in automated copywriting, with real estate marketing as a focal application. Our method is designed to align the generated content with user preferences while highlighting useful factual attributes. This agent consists of three key modules: (1) Grounding Module, mimicking expert human behavior to predict marketable features; (2) Personalization Module, aligning content with user preferences; (3) Marketing Module, ensuring factual accuracy and the inclusion of localized features. We conduct systematic human-subject experiments in the domain of real estate marketing, with a focus group of potential house buyers. The results demonstrate that marketing descriptions generated by our approach are preferred over those written by human experts by a clear margin while maintaining the same level of factual accuracy. Our findings suggest a promising agentic approach to automate large-scale targeted copywriting while ensuring factuality of content generation.