Hasil untuk "Infectious and parasitic diseases"

Menampilkan 20 dari ~1374656 hasil · dari arXiv, CrossRef

JSON API
arXiv Open Access 2025
A Novel CNN Gradient Boosting Ensemble for Guava Disease Detection

Tamim Ahasan Rijon, Yeasin Arafath

As a significant agricultural country, Bangladesh utilizes its fertile land for guava cultivation and dedicated labor to boost its economic development. In a nation like Bangladesh, enhancing guava production and agricultural practices plays a crucial role in its economy. Anthracnose and fruit fly infection can lower the quality and productivity of guava, a crucial tropical fruit. Expert systems that detect diseases early can reduce losses and safeguard the harvest. Images of guava fruits classified into the Healthy, Fruit Flies, and Anthracnose classes are included in the Guava Fruit Disease Dataset 2024 (GFDD24), which comes from plantations in Rajshahi and Pabna, Bangladesh. This study aims to create models using CNN alongside traditional machine learning techniques that can effectively identify guava diseases in locally cultivated varieties in Bangladesh. In order to achieve the highest classification accuracy of approximately 99.99% for the guava dataset, we propose utilizing ensemble models that combine CNNML with Gradient Boosting Machine. In general, the CNN-ML cascade framework exhibits strong, high-accuracy guava disease detection that is appropriate for real-time agricultural monitoring systems.

en cs.CV, cs.LG
arXiv Open Access 2025
Discovery of Disease Relationships via Transcriptomic Signature Analysis Powered by Agentic AI

Ke Chen, Haohan Wang

Modern disease classification often overlooks molecular commonalities hidden beneath divergent clinical presentations. This study introduces a transcriptomics-driven framework for discovering disease relationships by analyzing over 1300 disease-condition pairs using GenoMAS, a fully automated agentic AI system. Beyond identifying robust gene-level overlaps, we develop a novel pathway-based similarity framework that integrates multi-database enrichment analysis to quantify functional convergence across diseases. The resulting disease similarity network reveals both known comorbidities and previously undocumented cross-category links. By examining shared biological pathways, we explore potential molecular mechanisms underlying these connections-offering functional hypotheses that go beyond symptom-based taxonomies. We further show how background conditions such as obesity and hypertension modulate transcriptomic similarity, and identify therapeutic repurposing opportunities for rare diseases like autism spectrum disorder based on their molecular proximity to better-characterized conditions. In addition, this work demonstrates how biologically grounded agentic AI can scale transcriptomic analysis while enabling mechanistic interpretation across complex disease landscapes. All results are publicly accessible at github.com/KeeeeChen/Pathway_Similarity_Network.

en q-bio.GN, cs.LG
arXiv Open Access 2025
Chronic Diseases Prediction using Machine Learning and Deep Learning Methods

Houda Belhad, Asmae Bourbia, Salma Boughanja

Chronic diseases, such as cardiovascular disease, diabetes, chronic kidney disease, and thyroid disorders, are the leading causes of premature mortality worldwide. Early detection and intervention are crucial for improving patient outcomes, yet traditional diagnostic methods often fail due to the complex nature of these conditions. This study explores the application of machine learning (ML) and deep learning (DL) techniques to predict chronic disease and thyroid disorders. We used a variety of models, including Logistic Regression (LR), Random Forest (RF), Gradient Boosted Trees (GBT), Neural Networks (NN), Decision Trees (DT) and Native Bayes (NB), to analyze and predict disease outcomes. Our methodology involved comprehensive data pre-processing, including handling missing values, categorical encoding, and feature aggregation, followed by model training and evaluation. Performance metrics such ad precision, recall, accuracy, F1-score, and Area Under the Curve (AUC) were used to assess the effectiveness of each model. The results demonstrated that ensemble methods like Random Forest and Gradient Boosted Trees consistently outperformed. Neutral Networks also showed superior performance, particularly in capturing complex data patterns. The findings highlight the potential of ML and DL in revolutionizing chronic disease prediction, enabling early diagnosis and personalized treatment strategies. However, challenges such as data quality, model interpretability, and the need for advanced computational techniques in healthcare to improve patient outcomes and reduce the burden of chronic diseases. This study was conducted as part of Big Data class project under the supervision of our professors Mr. Abderrahmane EZ-ZAHOUT and Mr. Abdessamad ESSAIDI.

en cs.LG
arXiv Open Access 2025
ParaGate: Parasitic-Driven Domain Adaptation Transfer Learning for Netlist Performance Prediction

Bin Sun, Jingyi Zhou, Jianan Mu et al.

In traditional EDA flows, layout-level performance metrics are only obtainable after placement and routing, hindering global optimization at earlier stages. Although some neural-network-based solutions predict layout-level performance directly from netlists, they often face generalization challenges due to the black-box heuristics of commercial placement-and-routing tools, which create disparate data across designs. To this end, we propose ParaGate, a three-step cross-stage prediction framework that infers layout-level timing and power from netlists. First, we propose a two-phase transfer-learning approach to predict parasitic parameters, pre-training on mid-scale circuits and fine-tuning on larger ones to capture extreme conditions. Next, we rely on EDA tools for timing analysis, offloading the long-path numerical reasoning. Finally, ParaGate performs global calibration using subgraph features. Experiments show that ParaGate achieves strong generalization with minimal fine-tuning data: on openE906, its arrival-time R2 from 0.119 to 0.897. These results demonstrate that ParaGate could provide guidance for global optimization in the synthesis and placement stages.

en cs.LG, cs.AI
arXiv Open Access 2025
A cell-level model to predict the spatiotemporal dynamics of neurodegenerative disease

Shih-Huan Huang, Matthew W. Cotton, Tuomas P. J. Knowles et al.

A central challenge in modeling neurodegenerative diseases is connecting cellular-level mechanisms to tissue-level pathology, in particular to determine whether pathology is driven primarily by cell-autonomous triggers or by propagation from cells that are already in a pathological, runaway aggregation state. To bridge this gap, we here develop a bottom-up physical model that explicitly incorporates these two fundamental cell-level drivers of protein aggregation dynamics. We show that our model naturally explains the characteristic long, slow development of pathology followed by a rapid acceleration, a hallmark of many neurodegenerative diseases. Furthermore, the model reveals the existence of a critical switch point at which the system's dynamics transition from being dominated by slow, spontaneous formation of diseased cells to being driven by fast propagation. This framework provides a robust physical foundation for interpreting pathological data and offers a method to predict which class of therapeutic strategies is best matched to the underlying drivers of a specific disease.

en q-bio.QM, cond-mat.soft
arXiv Open Access 2024
Enhancing Biomedical Knowledge Discovery for Diseases: An Open-Source Framework Applied on Rett Syndrome and Alzheimer's Disease

Christos Theodoropoulos, Andrei Catalin Coman, James Henderson et al.

The ever-growing volume of biomedical publications creates a critical need for efficient knowledge discovery. In this context, we introduce an open-source end-to-end framework designed to construct knowledge around specific diseases directly from raw text. To facilitate research in disease-related knowledge discovery, we create two annotated datasets focused on Rett syndrome and Alzheimer's disease, enabling the identification of semantic relations between biomedical entities. Extensive benchmarking explores various ways to represent relations and entity representations, offering insights into optimal modeling strategies for semantic relation detection and highlighting language models' competence in knowledge discovery. We also conduct probing experiments using different layer representations and attention scores to explore transformers' ability to capture semantic relations.

en cs.CL, cs.AI
arXiv Open Access 2024
Prediction and Detection of Terminal Diseases Using Internet of Medical Things: A Review

Akeem Temitope Otapo, Alice Othmani, Ghazaleh Khodabandelou et al.

The integration of Artificial Intelligence (AI) and the Internet of Medical Things (IoMT) in healthcare, through Machine Learning (ML) and Deep Learning (DL) techniques, has advanced the prediction and diagnosis of chronic diseases. AI-driven models such as XGBoost, Random Forest, CNNs, and LSTM RNNs have achieved over 98\% accuracy in predicting heart disease, chronic kidney disease (CKD), Alzheimer's disease, and lung cancer, using datasets from platforms like Kaggle, UCI, private institutions, and real-time IoMT sources. However, challenges persist due to variations in data quality, patient demographics, and formats from different hospitals and research sources. The incorporation of IoMT data, which is vast and heterogeneous, adds complexities in ensuring interoperability and security to protect patient privacy. AI models often struggle with overfitting, performing well in controlled environments but less effectively in real-world clinical settings. Moreover, multi-morbidity scenarios especially for rare diseases like dementia, stroke, and cancers remain insufficiently addressed. Future research should focus on data standardization and advanced preprocessing techniques to improve data quality and interoperability. Transfer learning and ensemble methods are crucial for improving model generalizability across clinical settings. Additionally, the exploration of disease interactions and the development of predictive models for chronic illness intersections is needed. Creating standardized frameworks and open-source tools for integrating federated learning, blockchain, and differential privacy into IoMT systems will also ensure robust data privacy and security.

en cs.LG
arXiv Open Access 2023
Dataset Optimization for Chronic Disease Prediction with Bio-Inspired Feature Selection

Abeer Dyoub, Ivan Letteri

In this study, we investigated the application of bio-inspired optimization algorithms, including Genetic Algorithm, Particle Swarm Optimization, and Whale Optimization Algorithm, for feature selection in chronic disease prediction. The primary goal was to enhance the predictive accuracy of models streamline data dimensionality, and make predictions more interpretable and actionable. The research encompassed a comparative analysis of the three bio-inspired feature selection approaches across diverse chronic diseases, including diabetes, cancer, kidney, and cardiovascular diseases. Performance metrics such as accuracy, precision, recall, and f1 score are used to assess the effectiveness of the algorithms in reducing the number of features needed for accurate classification. The results in general demonstrate that the bio-inspired optimization algorithms are effective in reducing the number of features required for accurate classification. However, there have been variations in the performance of the algorithms on different datasets. The study highlights the importance of data pre-processing and cleaning in ensuring the reliability and effectiveness of the analysis. This study contributes to the advancement of predictive analytics in the realm of chronic diseases. The potential impact of this work extends to early intervention, precision medicine, and improved patient outcomes, providing new avenues for the delivery of healthcare services tailored to individual needs. The findings underscore the potential benefits of using bio-inspired optimization algorithms for feature selection in chronic disease prediction, offering valuable insights for improving healthcare outcomes.

en cs.NE, cs.LG
arXiv Open Access 2022
MAIScope: A low-cost portable microscope with built-in vision AI to automate microscopic diagnosis of diseases in remote rural settings

Rohan Sangameswaran

According to the World Health Organization(WHO), malaria is estimated to have killed 627,000 people and infected over 241 million people in 2020 alone, a 12% increase from 2019. Microscopic diagnosis of blood cells is the standard testing procedure to diagnose malaria. However, this style of diagnosis is expensive, time-consuming, and greatly subjective to human error, especially in developing nations that lack well-trained personnel to perform high-quality microscopy examinations. This paper proposes Mass-AI-Scope (MAIScope): a novel, low-cost, portable device that can take microscopic images and automatically detect malaria parasites with embedded AI. The device has two subsystems. The first subsystem is an on-device multi-layered deep learning network, that detects red blood cells (RBCs) from microscopic images, followed by a malaria parasite classifier that recognizes malaria parasites in the individual RBCs. The testing and validation demonstrated a high average accuracy of 89.9% for classification and average precision of 61.5% for detection models using TensorFlow Lite while addressing limited storage and computational capacity. This system also has cloud synchronization, which sends images to the cloud when connected to the Internet for analysis and model improvement purposes. The second subsystem is the hardware which consists of components like Raspberry Pi, a camera, a touch screen display, and an innovative low-cost bead microscope. Evaluation of the bead microscope demonstrated similar image quality with that of expensive light microscopes. The device is designed to be portable and work in remote environments without the Internet or power. The solution is extensible to other diseases requiring microscopy and can help standardize automation of disease diagnosis in rural parts of developing nations.

en eess.IV, cs.CV
arXiv Open Access 2021
Deep Neural Network Based Ensemble learning Algorithms for the healthcare system (diagnosis of chronic diseases)

Jafar Abdollahi, Babak Nouri-Moghaddam, Mehdi Ghazanfari

learning algorithms. In this paper, we review the classification algorithms used in the health care system (chronic diseases) and present the neural network-based Ensemble learning method. We briefly describe the commonly used algorithms and describe their critical properties. Materials and Methods: In this study, modern classification algorithms used in healthcare, examine the principles of these methods and guidelines, and to accurately diagnose and predict chronic diseases, superior machine learning algorithms with the neural network-based ensemble learning Is used. To do this, we use experimental data, real data on chronic patients (diabetes, heart, cancer) available on the UCI site. Results: We found that group algorithms designed to diagnose chronic diseases can be more effective than baseline algorithms. It also identifies several challenges to further advancing the classification of machine learning in the diagnosis of chronic diseases. Conclusion: The results show the high performance of the neural network-based Ensemble learning approach for the diagnosis and prediction of chronic diseases, which in this study reached 98.5, 99, and 100% accuracy, respectively.

en cs.LG, cs.AI
arXiv Open Access 2021
Mathematical modeling of shear-activated targeted nanoparticle drug delivery for the treatment of aortic diseases

Yonghui Qiao, Yan Wang, Yanlu Chen et al.

The human aorta is a high-risk area for vascular diseases, which are commonly restored by thoracic endovascular aortic repair. In this paper, we report a promising shear-activated targeted nanoparticle drug delivery strategy to assist in the treatment of coarctation of the aorta and aortic aneurysm. Idealized three-dimensional geometric models of coarctation of the aorta and aortic aneurysm are designed, respectively. The unique hemodynamic environment of the diseased aorta is used to improve nanoparticle drug delivery. Micro-carriers with nanoparticle drugs would be targeting activated to release nanoparticle drugs by local abnormal shear stress rate (SSR). Coarctation of the aorta provides a high SSR hemodynamic environment, while the aortic aneurysm is exposed to low SSR. We propose a method to calculate the SSR thresholds for the diseased aorta. Results show that the upstream near-wall area of the diseased location is an ideal injection location for the micro-carriers, which could be activated by the abnormal SSR. Released nanoparticle drugs would be successfully targeted delivered to the aortic diseased wall. Besides, the high diffusivity of the micro-carriers and nanoparticle drugs has a significant impact on the surface drug concentrations of the diseased aortic walls, especially for aortic aneurysms. This study preliminary demonstrates the feasibility of shear-activated targeted nanoparticle drug delivery in the treatment of aortic diseases and provides a theoretical basis for developing the drug delivery system and novel therapy.

en physics.med-ph
arXiv Open Access 2020
Unfolding selection to infer individual risk heterogeneity for optimising disease forecasts and policy development

M. Gabriela M. Gomes, Nicholas A. Feasey, Marcelo U. Ferreira et al.

Mathematical models are increasing adopted for setting targets for disease prevention and control. As model-informed policies are implemented, however, the inaccuracies of some forecasts become apparent, for example overprediction of infection burdens and overestimation of intervention impacts. Here, we attribute these discrepancies to methodological limitations in capturing the heterogeneities of real-world systems. The mechanisms underpinning single factors for infection and their interactions determine individual propensities to acquire disease. These are potentially so numerous that to attain a full mechanistic description may be unfeasible. To contribute constructively to the development of health policies, model developers either leave factors out (reductionism) or adopt a broader but coarse description (holism). In our view, predictive capacity requires holistic descriptions of heterogeneity which are currently underutilised in infectious disease epidemiology but common in other disciplines.

en q-bio.PE
arXiv Open Access 2020
A guided network propagation approach to identify disease genes that combines prior and new information

Borislav H. Hristov, Bernard Chazelle, Mona Singh

A major challenge in biomedical data science is to identify the causal genes underlying complex genetic diseases. Despite the massive influx of genome sequencing data, identifying disease-relevant genes remains difficult as individuals with the same disease may share very few, if any, genetic variants. Protein-protein interaction networks provide a means to tackle this heterogeneity, as genes causing the same disease tend to be proximal within networks. Previously, network propagation approaches have spread signal across the network from either known disease genes or genes that are newly putatively implicated in the disease (e.g., found to be mutated in exome studies or linked via genome-wide association studies). Here we introduce a general framework that considers both sources of data within a network context. Specifically, we use prior knowledge of disease-associated genes to guide random walks initiated from genes that are newly identified as perhaps disease-relevant. In large-scale testing across 24 cancer types, we demonstrate that our approach for integrating both prior and new information not only better identifies cancer driver genes than using either source of information alone but also readily outperforms other state-of-the-art network-based approaches. To demonstrate the versatility of our approach, we also apply it to genome-wide association data to identify genes functionally relevant for several complex diseases. Overall, our work suggests that guided network propagation approaches that utilize both prior and new data are a powerful means to identify disease genes.

en q-bio.GN, q-bio.MN

Halaman 51 dari 68733