T. W. Anderson, H. Rubin
Hasil untuk "Probabilities. Mathematical statistics"
Menampilkan 20 dari ~1724766 hasil · dari DOAJ, arXiv, CrossRef, Semantic Scholar
Nur Silviyah Rahmi, Suci Astutik, Ni Wayan Surya Wardhani et al.
Hypertension is a common degenerative disease with a high mortality rate and a significant impact on quality of life and productivity. Education level plays a crucial role in understanding and managing hypertension, where higher education levels can contribute to reducing the risk of hypertension. This study utilized meta-analysis and meta-regression to explore the relationship between education level and hypertension prevalence. Secondary data from eight previous studies conducted between 2015 and 2023 were analyzed. Heterogeneity analysis was performed to determine the appropriate meta-analysis model, with a random-effect model selected based on the test results. Of the eight studies analyzed, five showed a negative odds ratio, indicating that individuals with higher education levels have a lower likelihood of developing hypertension compared to those with lower education levels. The heterogeneity test showed significant variability among the studies (I2 = 91.38%). The random-effect model estimated a combined effect size with an ln odds ratio of -0.1777 and a 95% confidence interval of -0.3228 to -0.0326. These findings suggest that higher education levels are associated with a lower risk of hypertension. This underscores the importance of improving access to quality education as part of public health strategies to reduce the incidence of hypertension and enhance overall community well-being.
Lilis Harianti Hasibuan, Ferra Yanuar, Dodi Devianto et al.
The poverty line is the threshold income level below which a person or household is considered to be living in poverty. The poverty line is a representation of the minimum rupiah amount needed to meet the minimum basic food needs equivalent to 2100 kilocalories per capita per day and basic non-food needs. According to data from the Central Bureau of Statistics (BPS), although the poverty rate in West Sumatra has decreased in recent years, the issue of poverty is still very relevant to be discussed and addressed. The issue of the poverty line is important to discuss because it is directly related to the welfare of people and the development of a country. For modeling the poverty line and its influencing factors, appropriate statistical methods are needed. This research is about the comparison of two methods, namely the Bayesian quantile regression method and Bayesian LASSO quantile regression. The two methods are compared with the aim of seeing which method produces the smallest error. Bayesian quantile regression is one method that can model data assuming heteroscedasticity violations. This study compares the ordinary Bayesian quantile regression method with penalized LASSO. These two methods are applied in modeling the poverty line in West Sumatra. The purpose of this study is to see the best method for modeling data. The data used amounted to 133 data points from BPS in the years 2017 and 2023. Model parameters were estimated using MCMC with a Gibbs sampling approach. The results show that the Bayesian LASSO method is superior to the method without LASSO. This is evidenced that the superior method produces the smallest MSE value, 0.208, at quantile 0.5. Model poverty line in West Sumatra is significantly influenced by per capita spending ), Gross Regional Domestic Product ), Human Development Index ), Open Unemployment Rate , and minimum wages .
Muthia Nadhira Faladiba, Atina Ahdika
In a research study, population data are often not available, so the population parameter is unknown. Meanwhile, knowledge about the population parameter is needed to know the characteristics of the studied population. Therefore, it is needed to estimate the parameter of the population which can be estimated by sample data. There are several methods of parameter estimation which are generally classified into classical and Bayesian method. This research studied the Bayesian parameter estimation method to determine the parameters of the exponentially distributed survival data associated with the reliability measure of the estimates under symmetric and asymmetric loss functions for complete sample data in a closed form. The symmetric loss functions used in this research are Squared Error Loss Function (SELF) and Minimum Expected Loss Function (MELF). The asymmetric loss functions used are the General Entropy Loss Function (GELF) and Linex Loss Function (LLF). Performance of some loss functions used in this research are then compared through numerical simulation to select the best loss function in determining the parameter estimation of the exponentially distributed survival data. We also studied which loss function is best for underestimation and overestimation modeling. Based on simulation results, the Bayes estimates using MELF is the best method to estimate population parameters of the exponentially distributed survival data for the overestimation modeling, while LLF is the best for the underestimation modeling. We provided direct application in a case study of fluorescence lamp survival data. The results show that the best method to estimate the parameter of the standard fluorescence life data is using LLF for underestimation with and MELF for overestimation with .
Fachira Haneinanda Junianto, Adji Achmad Rinaldo Fernandes, Solimun Solimun et al.
This research aims to determine regional economic improvement to achieve a better Indonesian economy and accelerate the path to achieving a Golden Indonesia in 2045 so that it can be realized in a shorter time. This goal will be achieved with the help of statistical analysis methods, where the analysis used in this research is semiparametric truncated spline indirect effect and total effect analysis. The research becomes original in its approach with the utilization of this method and offers novel insights into the dynamics of regional economic development in Indonesia. These methods in this research serve as a tool for analyzing regional economic dynamics, identifying critical factors for improvement, informing policy decisions aimed at realizing Indonesia's economic aspirations for the future, and providing more flexible results to achieve the research objectives. The study was carried out on data with regional expenditure variables as exogenous variables, labor absorption variables as mediating endogenous variables, and regional economic growth variables as pure endogenous variables. The data used in the research are data published by the National/Provincial Central Bureau of Statistics in the form of the Indonesian Statistics Book, BPS publications in the form of Provinces, Provincial Government Financial Statistics, Directorate General of Financial Balance, Sumreg Bappenas, as well as from Ministries, Institutions or Agencies that related to providing data relating to the variables of this research in 2020. The results of this research are that the relationship between regional expenditure variables and labor absorption variables has a significant effect on regional economic growth variables.
Harish Chandra Yadav, Abhilasha Yadav, Susheel Kumar
This paper presents two approximations of the solution functions of Abel’s integral equations belong ing to classes Hα[0,1), Hϕ[0,1) by (λk+1 −1,M)th partial sums of their second kind Chebyshev wavelet expansion in the interval [0,1), for λ > 1. These approximations are E(1) λk+1−1,M (f), E(2) λk+1−1,M (f). Chebyshev wavelets of the second kind were used to solve Abel’s integral equations. The Chebyshev wavelet of the second kind leads to a solution that is almost identical to their exact solution. This research paper’s accomplishment in wavelet analysis is noteworthy.
Happy Alyzhya Haay, Suryasatriya Trihandaru, Bambang Susanto
In this research, the face painting recognition of Papua and Papua New Guinea was identified using the Convolutional Neural Network (CNN). This CNN method is one of the deep learning that is very well known and widely used in face recognition. The best training process model is obtained using the CNN architecture, namely ResNet-50, VGG-16, and VGG-19. The results obtained from the training model obtained an accuracy of 80.57% for the ResNet-50 model, 100% for the VGG-16 model, and 99.57% for the VGG-19 model. After the training process, predictions were continued using architectural models with test data. The prediction results obtained show that the accuracy of the ResNet-50 model is 0.70, the VGG-16 model is 0.82, and the VGG-19 model is 0.83. It means that the CNN architectural model that has the best performance in making predictions in identifying the recognition of Papua and Papua New Guinea's face painting is the VGG-19 model because the accuracy value obtained is 0.83.
A.R. Yeshkeyev, A.R. Yarullina, S.M. Amanbekov
The article is devoted to the study of semantic Jonsson quasivarieties of universal unars and undirected graphs. The first section of the article consists of basic necessary concepts from Jonsson model theory. The following two sections are results of using new notions of semantic Jonsson quasivariety of Robinson unars JCU and semantic Jonsson quasivariety of Robinson undirected graphs JCG, its elementary theory and semantic model. In order to prove two main results of the paper, Robinson spectra RSp(JCU) and RSp(JCG) and their partition onto equivalence classes [∆]U and [∆]G by cosemanticness relation were considered. The main results are presented in the form of theorems 11 and 13 and imply following useful corollaries: countably categorical Robinson theories of unars are totally categorical; countably categorical Robinson theories of undirected graphs are totally categorical. The obtained results can be useful for continuation of the various Jonsson algebras’ research, particularly semantic Jonsson quasivariety of S-acts over cyclic monoid.
Izza Dinikal Arsy, Dedi Rosadi
Risk-averse investors will seek out stock investments with the minimum risk. One step that can be taken is to develop a model of stock prices and predict their fluctuations in the coming months. Significant studies on the modeling of stock movements have used the ARCH/GARCH method, but this method requires some assumptions. This paper will discuss the performance of stock modeling using Support Vector Regression. The performance is measured using the root mean square error value in two stock clusters based on its volatility value, e.g., stocks with large volatility and stocks with small volatility. This case study makes use of daily closing price data from 10 LQ-45 index shares from October 12, 2018 to October 11, 2019. In conclusion, SVR's performance on stocks with high volatility produces RMSE, which is considerably higher than SVR's performance on stocks with low volatility.
Rochelle E. Tractenberg, Donna LaLonde, Suzanne Thornton
Consensus based publications of both competencies and undergraduate curriculum guidance documents targeting data science instruction for higher education have recently been published. Recommendations for curriculum features from diverse sources may not result in consistent training across programs. A Mastery Rubric was developed that prioritizes the promotion and documentation of formal growth as well as the development of independence needed for the 13 requisite knowledge, skills, and abilities for professional practice in statistics and data science, SDS. The Mastery Rubric, MR, driven curriculum can emphasize computation, statistics, or a third discipline in which the other would be deployed or, all three can be featured. The MR SDS supports each of these program structures while promoting consistency with international, consensus based, curricular recommendations for statistics and data science, and allows 'statistics', 'data science', and 'statistics and data science' curricula to consistently educate students with a focus on increasing learners independence. The Mastery Rubric construct integrates findings from the learning sciences, cognitive and educational psychology, to support teachers and students through the learning enterprise. The MR SDS will support higher education as well as the interests of business, government, and academic work force development, bringing a consistent framework to address challenges that exist for a domain that is claimed to be both an independent discipline and part of other disciplines, including computer science, engineering, and statistics. The MR-SDS can be used for development or revision of an evaluable curriculum that will reliably support the preparation of early e.g., undergraduate degree programs, middle e.g., upskilling and training programs, and late e.g., doctoral level training practitioners.
M. Opper, David Saad
M. Alizadeh, I. Ghosh, H. Yousof et al.
We introduce a new class of distributions called the generalized odd generalized exponential family. Some of its mathematical properties including explicit expressions for the ordinary and incomplete moments, quantile and generating functions, R ?́?nyi, Shannon and q-entropies, order statistics and probability weighted moments are derived. We also propose bivariate generalizations. We constructed a simple type Copula and intro-duced a useful stochastic property. The maximum likelihood method is used for estimating the model parameters. The importance and flexibility of the new family are illustrated by means of two applications to real data sets. We assess the performance of the maximum likelihood estimators in terms of biases and mean squared errors via a simulation study.
M. Mansour, M. Rasekhi, M. Ibrahim et al.
In this paper, we first study a new two parameter lifetime distribution. This distribution includes “monotone” and “non-monotone” hazard rate functions which are useful in lifetime data analysis and reliability. Some of its mathematical properties including explicit expressions for the ordinary and incomplete moments, generating function, Renyi entropy, δ-entropy, order statistics and probability weighted moments are derived. Non-Bayesian estimation methods such as the maximum likelihood, Cramer-Von-Mises, percentile estimation, and L-moments are used for estimating the model parameters. The importance and flexibility of the new distribution are illustrated by means of two applications to real data sets. Using the approach of the Bagdonavicius–Nikulin goodness-of-fit test for the right censored validation, we then propose and apply a modified chi-square goodness-of-fit test for the Burr X Weibull model. The modified goodness-of-fit statistics test is applied for the right censored real data set. Based on the censored maximum likelihood estimators on initial data, the modified goodness-of-fit test recovers the loss in information while the grouped data follows the chi-square distribution. The elements of the modified criteria tests are derived. A real data application is for validation under the uncensored scheme.
Sumin Sumin, Heri Retnawati
The Central Statistics Agency has published a survey report on the happiness of the Indonesian people in 2017. The survey results show that there are disparities that vary between provinces. The province with the highest happiness index was North Maluku, while the province with the lowest happiness index was Papua. Based on this phenomenon, the researcher wants to map the provinces based on the similarity of happiness levels. Researchers used quantitative descriptive methods with data analysis using multidimensional scaling. The results show that the provinces that have similarities with the happiest group are: [1] North Maluku province is like Riau Islands, Gorontalo, North Sulawesi, and Maluku. [2] South Kalimantan is like North Kalimantan, East Kalimantan, DI Yogyakarta, and Bali. [3] DKI Jakarta is like West Papua. [4] South Sulawesi is like West Sumatra, Riau, and South Sumatra. [5] Aceh is like Kep. Bangka Belitung. The less happy group [1] West Java is like Banten, Central Java, Central Kalimantan, Jambi, and East Java. [2] North Sumatra is like Papua. [3] Central Sulawesi is like Southeast Sulawesi, West Nusa Tenggara, Bengkulu, West Kalimantan, West Sulawesi, Lampung, and East Nusa Tenggara.
Ben Dai, Xiaotong Shen, Lin Yee Chen et al.
In explainable artificial intelligence, discriminative feature localization is critical to reveal a blackbox model's decision-making process from raw data to prediction. In this article, we use two real datasets, the MNIST handwritten digits and MIT-BIH Electrocardiogram (ECG) signals, to motivate key characteristics of discriminative features, namely adaptiveness, predictive importance and effectiveness. Then, we develop a localization framework based on adversarial attacks to effectively localize discriminative features. In contrast to existing heuristic methods, we also provide a statistically guaranteed interpretability of the localized features by measuring a generalized partial $R^2$. We apply the proposed method to the MNIST dataset and the MIT-BIH dataset with a convolutional auto-encoder. In the first, the compact image regions localized by the proposed method are visually appealing. Similarly, in the second, the identified ECG features are biologically plausible and consistent with cardiac electrophysiological principles while locating subtle anomalies in a QRS complex that may not be discernible by the naked eye. Overall, the proposed method compares favorably with state-of-the-art competitors. Accompanying this paper is a Python library dnn-locate (https://dnn-locate.readthedocs.io/en/latest/) that implements the proposed approach.
Zhipeng Wang, D. W. Scott
Density estimation is one of the central areas of statistics whose purpose is to estimate the probability density function underlying the observed data. It serves as a building block for many tasks in statistical inference, visualization, and machine learning. Density estimation is widely adopted in the domain of unsupervised learning especially for the application of clustering. As big data become pervasive in almost every area of data sciences, analyzing high‐dimensional data that have many features and variables appears to be a major focus in both academia and industry. High‐dimensional data pose challenges not only from the theoretical aspects of statistical inference, but also from the algorithmic/computational considerations of machine learning and data analytics. This paper reviews a collection of selected nonparametric density estimation algorithms for high‐dimensional data, some of them are recently published and provide interesting mathematical insights. The important application domain of nonparametric density estimation, such as modal clustering, is also included in this paper. Several research directions related to density estimation and high‐dimensional data analysis are suggested by the authors.
Szabolcs Suveges, Ibrahim Chamseddine, Katarzyna A. Rejniak et al.
The specific structure of the extracellular matrix (ECM), and in particular the density and orientation of collagen fibres, plays an important role in the evolution of solid cancers. While many experimental studies discussed the role of ECM in individual and collective cell migration, there are still unanswered questions about the impact of nonlocal cell sensing of other cells on the overall shape of tumour aggregation and its migration type. There are also unanswered questions about the migration and spread of tumour that arises at the boundary between different tissues with different collagen fibre orientations. To address these questions, in this study we develop a hybrid multi-scale model that considers the cells as individual entities and ECM as a continuous field. The numerical simulations obtained through this model match experimental observations, confirming that tumour aggregations are not moving if the ECM fibres are distributed randomly, and they only move when the ECM fibres are highly aligned. Moreover, the stationary tumour aggregations can have circular shapes or irregular shapes (with finger-like protrusions), while the moving tumour aggregations have elongate shapes (resembling to clusters, strands or files). We also show that the cell sensing radius impacts tumour shape only when there is a low ratio of fibre to non-fibre ECM components. Finally, we investigate the impact of different ECM fibre orientations corresponding to different tissues, on the overall tumour invasion of these neighbouring tissues.
Manuel Hohmann, Christian Pfeifer, Nicoleta Voicu
The paper introduces a general mathematical framework for action based field theories on Finsler spacetimes. As most often fields on Finsler spacetime (e.g., the Finsler fundamental function or the resulting metric tensor) have a homogeneous dependence on the tangent directions of spacetime, we construct the appropriate configuration bundles whose sections are such homogeneous fields; on these configuration bundles, the tools of coordinate free calculus of variations can be consistently applied to obtain field equations. Moreover, we prove that general covariance of natural Finsler field Lagrangians leads to an averaged energy-momentum conservation law which, in the particular case of Lorentzian spacetimes, is equivalent to the usual, pointwise energy-momentum covariant conservation law.
Guillaume Chauvet
Nizam Uddin, Mohamad S. Hasan
A new test statistic is proposed to test the equality of two normal means when the data is partially paired and partially unpaired. The test statistic is based on a linear combination of the differences of both paired and unpaired sample means. Using t-distribution as the approximate null distribution, the proposed method is evaluated against some other standard methods known in the literature. For samples from normal and logistic distributions with equal variances, the proposed method appears to perform better than other methods with respect to power while keeping the type I error rates very competitive.
Halaman 18 dari 86239