Hasil untuk "machine learning"

Menampilkan 20 dari ~10323585 hasil · dari DOAJ, Semantic Scholar, CrossRef

JSON API
S2 Open Access 2019
Advances and Open Problems in Federated Learning

P. Kairouz, H. B. McMahan, Brendan Avent et al.

Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while keeping the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs resulting from traditional, centralized machine learning and data science approaches. Motivated by the explosive growth in FL research, this paper discusses recent advances and presents an extensive collection of open problems and challenges.

8199 sitasi en Computer Science, Mathematics
S2 Open Access 2017
Machine Learning Methods for Histopathological Image Analysis

D. Komura, S. Ishikawa

Abundant accumulation of digital histopathological images has led to the increased demand for their analysis, such as computer-aided diagnosis using machine learning techniques. However, digital pathological images and related tasks have some issues to be considered. In this mini-review, we introduce the application of digital pathological image analysis using machine learning algorithms, address some problems specific to such analysis, and propose possible solutions.

812 sitasi en Computer Science, Medicine
S2 Open Access 2017
Ten quick tips for machine learning in computational biology

D. Chicco

Machine learning has become a pivotal tool for many projects in computational biology, bioinformatics, and health informatics. Nevertheless, beginners and biomedical researchers often do not have enough experience to run a data mining project effectively, and therefore can follow incorrect practices, that may lead to common mistakes or over-optimistic results. With this review, we present ten quick tips to take advantage of machine learning in any computational biology context, by avoiding some common errors that we observed hundreds of times in multiple bioinformatics projects. We believe our ten suggestions can strongly help any machine learning practitioner to carry on a successful project in computational biology and related sciences.

763 sitasi en Medicine, Computer Science
S2 Open Access 2017
Data Mining and Analytics in the Process Industry: The Role of Machine Learning

Zhiqiang Ge, Zhihuan Song, S. Ding et al.

Data mining and analytics have played an important role in knowledge discovery and decision making/supports in the process industry over the past several decades. As a computational engine to data mining and analytics, machine learning serves as basic tools for information extraction, data pattern recognition and predictions. From the perspective of machine learning, this paper provides a review on existing data mining and analytics applications in the process industry over the past several decades. The state-of-the-art of data mining and analytics are reviewed through eight unsupervised learning and ten supervised learning algorithms, as well as the application status of semi-supervised learning algorithms. Several perspectives are highlighted and discussed for future researches on data mining and analytics in the process industry.

743 sitasi en Computer Science
S2 Open Access 2008
Faster and Better: A Machine Learning Approach to Corner Detection

E. Rosten, R. Porter, T. Drummond

The repeatability and efficiency of a corner detector determines how likely it is to be useful in a real-world application. The repeatability is important because the same scene viewed from different positions should yield features which correspond to the same real-world 3D locations. The efficiency is important because this determines whether the detector combined with further processing can operate at frame rate. Three advances are described in this paper. First, we present a new heuristic for feature detection and, using machine learning, we derive a feature detector from this which can fully process live PAL video using less than 5 percent of the available processing time. By comparison, most other detectors cannot even operate at frame rate (Harris detector 115 percent, SIFT 195 percent). Second, we generalize the detector, allowing it to be optimized for repeatability, with little loss of efficiency. Third, we carry out a rigorous comparison of corner detectors based on the above repeatability criterion applied to 3D scenes. We show that, despite being principally constructed for speed, on these stringent tests, our heuristic detector significantly outperforms existing feature detectors. Finally, the comparison demonstrates that using machine learning produces significant improvements in repeatability, yielding a detector that is both very fast and of very high quality.

1996 sitasi en Computer Science, Medicine
S2 Open Access 2018
A strategy to apply machine learning to small datasets in materials science

Ying Zhang, Chen Ling

There is growing interest in applying machine learning techniques in the research of materials science. However, although it is recognized that materials datasets are typically smaller and sometimes more diverse compared to other fields, the influence of availability of materials data on training machine learning models has not yet been studied, which prevents the possibility to establish accurate predictive rules using small materials datasets. Here we analyzed the fundamental interplay between the availability of materials data and the predictive capability of machine learning models. Instead of affecting the model precision directly, the effect of data size is mediated by the degree of freedom (DoF) of model, resulting in the phenomenon of association between precision and DoF. The appearance of precision–DoF association signals the issue of underfitting and is characterized by large bias of prediction, which consequently restricts the accurate prediction in unknown domains. We proposed to incorporate the crude estimation of property in the feature space to establish ML models using small sized materials data, which increases the accuracy of prediction without the cost of higher DoF. In three case studies of predicting the band gap of binary semiconductors, lattice thermal conductivity, and elastic properties of zeolites, the integration of crude estimation effectively boosted the predictive capability of machine learning models to state-of-art levels, demonstrating the generality of the proposed strategy to construct accurate machine learning models using small materials dataset.MACHINE LEARNING: Dealing with small datasetsMachine learning can be useful for materials prediction if crude estimations of the outcome are integrated in the code. Machine learning has been attracting tremendous attention lately due to its predictive power; evidence suggests it is directly proportional to the size of the available datasets. Machine learning can be useful in predicting new materials and novel properties, but materials sets tend to be smaller and more diverse than other fields. Ying Zhang and Chen Ling from the Toyota Research Institute of North America report that these small datasets affect the freedom of the algorithms and thus limit their predictive capabilities. In order to counterbalance the effect, they suggest introducing in the code crude estimations of the targeted property, obtained by other means.

657 sitasi en
S2 Open Access 2017
Survey of Machine Learning Algorithms for Disease Diagnostic

M. Fatima, Maruf Pasha

In medical imaging, Computer Aided Diagnosis (CAD) is a rapidly growing dynamic area of research. In recent years, significant attempts are made for the enhancement of computer aided diagnosis applications because errors in medical diagnostic systems can result in seriously misleading medical treatments. Machine learning is important in Computer Aided Diagnosis. After using an easy equation, objects such as organs may not be indicated accurately. So, pattern recognition fundamentally involves learning from examples. In the field of bio-medical, pattern recognition and machine learning promise the improved accuracy of perception and diagnosis of disease. They also promote the objectivity of decision-making process. For the analysis of high-dimensional and multimodal bio-medical data, machine learning offers a worthy approach for making classy and automatic algorithms. This survey paper provides the comparative analysis of different machine learning algorithms for diagnosis of different diseases such as heart disease, diabetes disease, liver disease, dengue disease and hepatitis disease. It brings attention towards the suite of machine learning algorithms and tools that are used for the analysis of diseases and decision-making process accordingly.

596 sitasi en Computer Science
S2 Open Access 2019
Machine learning applications in epilepsy

B. Abbasi, D. Goldenholz

Machine learning leverages statistical and computer science principles to develop algorithms capable of improving performance through interpretation of data rather than through explicit instructions. Alongside widespread use in image recognition, language processing, and data mining, machine learning techniques have received increasing attention in medical applications, ranging from automated imaging analysis to disease forecasting. This review examines the parallel progress made in epilepsy, highlighting applications in automated seizure detection from electroencephalography (EEG), video, and kinetic data, automated imaging analysis and pre‐surgical planning, prediction of medication response, and prediction of medical and surgical outcomes using a wide variety of data sources. A brief overview of commonly used machine learning approaches, as well as challenges in further application of machine learning techniques in epilepsy, is also presented. With increasing computational capabilities, availability of effective machine learning algorithms, and accumulation of larger datasets, clinicians and researchers will increasingly benefit from familiarity with these techniques and the significant progress already made in their application in epilepsy.

297 sitasi en Medicine, Computer Science
S2 Open Access 2019
Machine learning and complex biological data

Chunming Xu, S. Jackson

Machine learning has demonstrated potential in analyzing large, complex biological data. In practice, however, biological information is required in addition to machine learning for successful application.

292 sitasi en Medicine, Biology
S2 Open Access 2019
Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications

Hyunseok Seo, M. B. Khuzani, V. Vasudevan et al.

In recent years, significant progress has been made in developing more accurate and efficient machine learning algorithms for segmentation of medical and natural images. In this review article, we highlight the imperative role of machine learning algorithms in enabling efficient and accurate segmentation in the field of medical imaging. We specifically focus on several key studies pertaining to the application of machine learning methods to biomedical image segmentation. We review classical machine learning algorithms such as Markov random fields, k-means clustering, random forest, etc. Although such classical learning models are often less accurate compared to the deep-learning techniques, they are often more sample efficient and have a less complex structure. We also review different deep-learning architectures, such as the artificial neural networks (ANNs), the convolutional neural networks (CNNs), and the recurrent neural networks (RNNs), and present the segmentation results attained by those learning models that were published in the past 3 yr. We highlight the successes and limitations of each machine learning paradigm. In addition, we discuss several challenges related to the training of different machine learning models, and we present some heuristics to address those challenges.

265 sitasi en Computer Science, Engineering
S2 Open Access 2019
Optimization Models for Machine Learning: A Survey

Claudio Gambella, Bissan Ghaddar, Joe Naoum-Sawaya

Abstract This paper surveys the machine learning literature and presents in an optimization framework several commonly used machine learning approaches. Particularly, mathematical optimization models are presented for regression, classification, clustering, deep learning, and adversarial learning, as well as new emerging applications in machine teaching, empirical model learning, and Bayesian network structure learning. Such models can benefit from the advancement of numerical optimization techniques which have already played a distinctive role in several machine learning settings. The strengths and the shortcomings of these models are discussed and potential research directions and open problems are highlighted.

237 sitasi en Computer Science, Mathematics
DOAJ Open Access 2025
Rapid and Non-Destructive Assessment of Eight Essential Amino Acids in Foxtail Millet: Development of an Efficient and Accurate Detection Model Based on Near-Infrared Hyperspectral

Anqi Gao, Xiaofu Wang, Erhu Guo et al.

Foxtail millet is a vital grain whose amino acid content affects nutritional quality. Traditional detection methods are destructive, time-consuming, and inefficient. This work established a rapid and non-destructive method for detecting essential amino acids in the foxtail millet. To address these limitations, this study developed a rapid, non-destructive approach for quantifying eight essential amino acids—lysine, phenylalanine, methionine, threonine, isoleucine, leucine, valine, and histidine—in foxtail millet (variety: Changnong No. 47) using near-infrared hyperspectral imaging. A total of 217 samples were collected and used for model development. The spectral data were preprocessed using Savitzky–Golay, adaptive iteratively reweighted penalized least squares, and standard normal variate. The key wavelengths were extracted using the competitive adaptive reweighted sampling algorithm, and four regression models—Partial Least Squares Regression (PLSR), Support Vector Regression (SVR), Convolutional Neural Network (CNN), and Bidirectional Long Short-Term Memory (BiLSTM)—were constructed. The results showed that the key wavelengths selected by CARS account for only 2.03–4.73% of the full spectrum. BiLSTM was most suitable for modeling lysine (R<sup>2</sup> = 0.5862, RMSE = 0.0081, RPD = 1.6417). CNN demonstrated the best performance for phenylalanine, methionine, isoleucine, and leucine. SVR was most effective for predicting threonine (R<sup>2</sup> = 0.8037, RMSE = 0.0090, RPD = 2.2570), valine, and histidine. This study offers an effective novel approach for intelligent quality assessment of grains.

Chemical technology

Halaman 20 dari 516180