Hasil untuk "Automation"

Menampilkan 20 dari ~850457 hasil · dari arXiv, DOAJ, Semantic Scholar, CrossRef

JSON API
arXiv Open Access 2025
GitHub Marketplace: Driving Automation and Fostering Innovation in Software Development

SK. Golam Saroar, Waseefa Ahmed, Elmira Onagh et al.

GitHub, a central hub for collaborative software development, has revolutionized the open-source software (OSS) ecosystem through its GitHub Marketplace, a platform launched in 2017 to host automation tools aimed at enhancing the efficiency and scalability of software projects. As the adoption of automation in OSS production grows, understanding the trends, characteristics, and underlying dynamics of this marketplace has become vital. Furthermore, despite the rich repository of academic research on software automation, a disconnect persists between academia and industry practices. This study seeks to bridge this gap by providing a systematic analysis of the GitHub Marketplace, comparing trends observed in industry tools with advancements reported in academic literature, and identifying areas where academia can contribute to practical innovation.

en cs.SE
arXiv Open Access 2025
Evaluating the Economic Feasibility of Labor Replacement Through Robotics and Automation in Qatar

Tariq Eldakruri, Edip Senyurek

This paper investigates the economic feasibility of replacing human labor with robotics and automation in Qatar's manufacturing and service sectors. By analyzing labor costs, productivity gains, and implementation expenses, the study assesses the potential financial impact and return on investment of robotic integration. Results indicate the sectors where automation is economically viable and identify challenges related to workforce adaptation, policy, and infrastructure. These insights provide guidance for policymakers and industry stakeholders considering automation strategies in Qatar.

arXiv Open Access 2025
Agent-Initiated Interaction in Phone UI Automation

Noam Kahlon, Guy Rom, Anatoly Efros et al.

Phone automation agents aim to autonomously perform a given natural-language user request, such as scheduling appointments or booking a hotel. While much research effort has been devoted to screen understanding and action planning, complex tasks often necessitate user interaction for successful completion. Aligning the agent with the user's expectations is crucial for building trust and enabling personalized experiences. This requires the agent to proactively engage the user when necessary, avoiding actions that violate their preferences while refraining from unnecessary questions where a default action is expected. We argue that such subtle agent-initiated interaction with the user deserves focused research attention. To promote such research, this paper introduces a task formulation for detecting the need for user interaction and generating appropriate messages. We thoroughly define the task, including aspects like interaction timing and the scope of the agent's autonomy. Using this definition, we derived annotation guidelines and created AndroidInteraction, a diverse dataset for the task, leveraging an existing UI automation dataset. We tested several text-based and multimodal baseline models for the task, finding that it is very challenging for current LLMs. We suggest that our task formulation, dataset, baseline models and analysis will be valuable for future UI automation research, specifically in addressing this crucial yet often overlooked aspect of agent-initiated interaction. This work provides a needed foundation to allow personalized agents to properly engage the user when needed, within the context of phone UI automation.

en cs.HC
arXiv Open Access 2025
GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection

Zishu Wei, Qixiang Ma, Xavier Hu et al.

Building AI systems for GUI automation task has attracted remarkable research efforts, where MLLMs are leveraged for processing user requirements and give operations. However, GUI automation includes a wide range of tasks, from document processing to online shopping, from CAD to video editing. Diversity between particular tasks requires MLLMs for GUI automation to have heterogeneous capabilities and master multidimensional expertise, raising problems on constructing such a model. To address such challenge, we propose GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection, a novel MLLM-based GUI automation agent framework designed for integrating knowledge and combining capabilities from heterogeneous models to build GUI automation agent systems with higher performance. Since different GUI-specific MLLMs are trained on different dataset and thus have different strengths, GAIR introduced a general-purpose MLLM for jointly processing the information from multiple GUI-specific models, further enhancing performance of the agent framework. The general-purpose MLLM also serves as decision maker, trying to execute a reasonable operation based on previously gathered information. When the general-purpose model thinks that there isn't sufficient information for a reasonable decision, GAIR would transit into group reflection status, where the general-purpose model would provide GUI-specific models with different instructions and hints based on their strengths and weaknesses, driving them to gather information with more significance and accuracy that can support deeper reasoning and decision. We evaluated the effectiveness and reliability of GAIR through extensive experiments on GUI benchmarks.

en cs.MA, cs.AI
arXiv Open Access 2025
GATE: An Integrated Assessment Model for AI Automation

Ege Erdil, Andrei Potlogea, Tamay Besiroglu et al.

Assessing the economic impacts of artificial intelligence requires integrating insights from both computer science and economics. We present the Growth and AI Transition Endogenous model (GATE), a dynamic integrated assessment model that simulates the economic effects of AI automation. GATE combines three key ingredients that have not been brought together in previous work: (1) a compute-based model of AI development, (2) an AI automation framework, and (3) a semi-endogenous growth model featuring endogenous investment and adjustment costs. The model allows users to simulate the economic effects of the transition to advanced AI across a range of potential scenarios. GATE captures the interactions between economic variables, including investment, automation, innovation, and growth, as well as AI-related inputs such as compute and algorithms. This paper explains the model's structure and functionality, emphasizing AI development for economists and economic modeling for the AI community. The model is implemented in an interactive sandbox, enabling users to explore the impact of AI under different parameter choices and policy interventions. The modeling sandbox is available at: www.epoch.ai/GATE.

en econ.GN
DOAJ Open Access 2025
Leveraging IoT, digital twin and machine learning for smart energy audit in office building: a systematic literature review and recommendations

Ali Zaenal Abidin, I Ketut Agung Enriko, Aloysius Adya Pramudita

Energy audits play a pivotal role in improving energy efficiency and reducing carbon emissions in office buildings. However, conventional audits often suffer from fragmented insights, lack of system-level monitoring, establishing energy baseline, and insufficient incorporation of occupant behavior. To address these challenges, this study conducts a systematic literature review of recent applications of Internet of Things (IoT), machine learning (ML), and digital twin (DT) technologies in the energy audit domain. The review, guided by PRISMA methodology, analyzes eleven selected studies published between 2022 and 2024, revealing that while ML dominates in predictive modeling, IoT and DT remain underutilized in delivering integrated, efficiency recommendations. The analysis identifies three key engineering gaps: limited use of occupant behavior data, absence of continuous energy baseline modeling, and lack of systems capable of generating real-time efficiency recommendations. In response, this paper proposes a novel AIoT-based energy audit framework that combines real-time monitoring via IoT with ML-driven analytics and optimization, supported optionally by DT-based simulation. The proposed framework aims to enable continuous, system-level audits aligned with ISO 50000 standards, offering practical pathways for building managers to diagnose inefficiencies and implement energy-saving actions. Validating the model in real-world office environments, expanding input variables, and integration strategy with building automation systems are further important study to realize intelligent and scalable energy audit solutions.

Electrical engineering. Electronics. Nuclear engineering
DOAJ Open Access 2025
Frequency domain estimation method of the characteristic period of the P wave of earthquakes

Codrin Donciu, Elena Serea, Marinel Costel Temneanu

Early warning system earthquake alerts exploit the time delay that the surface waves have in reference to the P waves and estimate the magnitude based on the interpretation of the specific parameters of the P waves. One of the most commonly used parameters for estimating the moment magnitude of an earthquake is the characteristic period measured in the first 3 s after the appearance of the P wave. The classic method determines the characteristic period in the time domain by using the velocity and displacement waves of the acquired samples. In this paper, we present a new method for estimating the characteristic period through its corresponding frequency. This method includes zero padding of the P-wave sequence, conversion of the extended sequence from the time domain to the frequency domain, identification of local frequency maxima, and calculation of the weighted average of the frequency based on the identified maxima. Tests conducted on synthetic signals, as well as standard deviation evaluation tests for simultaneous recordings at several seismic stations, revealed better performance than the classic method in terms of noise immunity and number of false alarms.

DOAJ Open Access 2025
On BESS Capacity Optimization of Hybrid Coal-Fired Generator and BESS Power Station for Secondary Frequency Regulation

Xinsong Zhang, Ziyun Ma, Beiping Gu et al.

Integrating battery energy storage systems (BESS) into a coal-fired generator can enhance power systems’ secondary frequency regulation capability. To this end, this paper proposes a policy to coordinate the BESS and the coal-fired generator to meet the automatic generation control (AGC) requirements and subsequently investigates the optimal BESS capacity to maximize the net profit gained from the frequency regulation. Firstly, probability characteristics of AGC instruction duration period, interval period, regulation rate, and regulation direction are investigated according to the real AGC datalog. The AGC regulation direction would change randomly. Directly switching the BESS between charging and discharging to match the requirement would significantly deplete the lifetime of the battery. Therefore, this paper divides the BESS into two groups, which will be controlled stay in charging and discharging states to respectively respond to the AGC requirement of “up” and “down”. To obtain a cost-effective BESS investment, this paper develops a new sizing method, which optimizes the BESS capacity by simulating the operation of the hybrid coal-fired generator and BESS power station (HCGBPS) over an example day. Finally, the case studies justified the developed sizing method could make satisfying BESS investment decisions to ensure a maximum net profit in operation.

Electrical engineering. Electronics. Nuclear engineering
DOAJ Open Access 2025
An improved deep CNN-based freshwater fish classification with cascaded bio-inspired networks

Asadullah Shaikh, Wahidur Rahman, Kaniz Roksana et al.

Bangladesh has plentiful water, which is essential to its freshwater fish traditions. Environmental concerns and other causes have reduced the country's water resources, threatening many native freshwater fish species. Thus, the younger generation deficiencies recognition of local freshwater fish and struggles to recognize them. Traditional methods are very insufficient to overcome these issues. To address these research gaps, the research proposes an automatic system for categorizing Bangladesh's freshwater fish. The proposed methodology involves several key steps, including building a comprehensive dataset, extracting features from fish images using pre-trained Convolutional Neural Network (CNN) models, and employing typical ML approaches. Initially comprising eight classes, the dataset undergoes feature extraction using CNN algorithms, followed by the utilization of various feature selection methods such as Support Vector Classifier, Principal Component Analysis, Linear Discriminant Analysis, and optimization models like Particle Swarm Optimization, Bacterial Foraging Optimization, and Cat Swarm Optimization. In the final phase, seven conventional ML techniques are applied to classify the images of local fishes. Empirical measurements are gathered and analyzed to assess the proposed framework's performance. Particularly, the present approach achieves the highest accuracy of 98.71% through the utilization of the ML mechanism Logistic Regression with Resnet50, SVC, and CSO models.

Control engineering systems. Automatic machinery (General), Automation
arXiv Open Access 2024
Fine-Tuning and Prompt Engineering for Large Language Models-based Code Review Automation

Chanathip Pornprasit, Chakkrit Tantithamthavorn

Context: The rapid evolution of Large Language Models (LLMs) has sparked significant interest in leveraging their capabilities for automating code review processes. Prior studies often focus on developing LLMs for code review automation, yet require expensive resources, which is infeasible for organizations with limited budgets and resources. Thus, fine-tuning and prompt engineering are the two common approaches to leveraging LLMs for code review automation. Objective: We aim to investigate the performance of LLMs-based code review automation based on two contexts, i.e., when LLMs are leveraged by fine-tuning and prompting. Fine-tuning involves training the model on a specific code review dataset, while prompting involves providing explicit instructions to guide the model's generation process without requiring a specific code review dataset. Method: We leverage model fine-tuning and inference techniques (i.e., zero-shot learning, few-shot learning and persona) on LLMs-based code review automation. In total, we investigate 12 variations of two LLMs-based code review automation (i.e., GPT- 3.5 and Magicoder), and compare them with the Guo et al.'s approach and three existing code review automation approaches. Results: The fine-tuning of GPT 3.5 with zero-shot learning helps GPT-3.5 to achieve 73.17% -74.23% higher EM than the Guo et al.'s approach. In addition, when GPT-3.5 is not fine-tuned, GPT-3.5 with few-shot learning achieves 46.38% - 659.09% higher EM than GPT-3.5 with zero-shot learning. Conclusions: Based on our results, we recommend that (1) LLMs for code review automation should be fine-tuned to achieve the highest performance; and (2) when data is not sufficient for model fine-tuning (e.g., a cold-start problem), few-shot learning without a persona should be used for LLMs for code review automation.

en cs.SE
arXiv Open Access 2024
Automating the Enterprise with Foundation Models

Michael Wornow, Avanika Narayan, Krista Opsahl-Ong et al.

Automating enterprise workflows could unlock $4 trillion/year in productivity gains. Despite being of interest to the data management community for decades, the ultimate vision of end-to-end workflow automation has remained elusive. Current solutions rely on process mining and robotic process automation (RPA), in which a bot is hard-coded to follow a set of predefined rules for completing a workflow. Through case studies of a hospital and large B2B enterprise, we find that the adoption of RPA has been inhibited by high set-up costs (12-18 months), unreliable execution (60% initial accuracy), and burdensome maintenance (requiring multiple FTEs). Multimodal foundation models (FMs) such as GPT-4 offer a promising new approach for end-to-end workflow automation given their generalized reasoning and planning abilities. To study these capabilities we propose ECLAIR, a system to automate enterprise workflows with minimal human supervision. We conduct initial experiments showing that multimodal FMs can address the limitations of traditional RPA with (1) near-human-level understanding of workflows (93% accuracy on a workflow understanding task) and (2) instant set-up with minimal technical barrier (based solely on a natural language description of a workflow, ECLAIR achieves end-to-end completion rates of 40%). We identify human-AI collaboration, validation, and self-improvement as open challenges, and suggest ways they can be solved with data management techniques. Code is available at: https://github.com/HazyResearch/eclair-agents

en cs.SE, cs.AI
DOAJ Open Access 2024
Integrating routine blood biomarkers and artificial intelligence for supporting diagnosis of silicosis in engineered stone workers

Daniel Sanchez‐Morillo, Antonio León‐Jiménez, María Guerrero‐Chanivet et al.

Abstract Engineered stone silicosis (ESS), primarily caused by inhaling respirable crystalline silica, poses a significant occupational health risk globally. ESS has no effective treatment and presents a rapid progression from simple silicosis (SS) to progressive massive fibrosis (PMF), with respiratory failure and death. Despite the use of diagnostic methods like chest x‐rays and high‐resolution computed tomography, early detection of silicosis remains challenging. Since routine blood tests have shown promise in detecting inflammatory markers associated with the disease, this study aims to assess whether routine blood biomarkers, coupled with machine learning techniques, can effectively differentiate between healthy individuals, subjects with SS, and PMF. To this end, 107 men diagnosed with silicosis, ex‐workers in the engineered stone (ES) sector, and 22 healthy male volunteers as controls not exposed to ES dust were recruited. Twenty‐one primary biochemical markers derived from peripheral blood extraction were obtained retrospectively from clinical hospital records. Relief‐F features selection technique was applied, and the resulting subset of 11 biomarkers was used to build five machine learning models, demonstrating high performance with sensitivities and specificities in the best case greater than 82% and 89%, respectively. The percentage of lymphocytes, the angiotensin‐converting enzyme, and lactate dehydrogenase indexes were revealed, among others, as blood biomarkers with significant cumulative importance for the machine learning models. Our study reveals that these biomarkers could detect a chronic inflammatory status and potentially serve as a supportive tool for the diagnosis, monitoring, and early detection of the progression of silicosis.

Chemical engineering, Biotechnology
DOAJ Open Access 2024
An Apple Detection and Localization Method for Automated Harvesting under Adverse Light Conditions

Guoyu Zhang, Ye Tian, Wenhan Yin et al.

The use of automation technology in agriculture has become particularly important as global agriculture is challenged by labor shortages and efficiency gains. The automated process for harvesting apples, an important agricultural product, relies on efficient and accurate detection and localization technology to ensure the quality and quantity of production. Adverse lighting conditions can significantly reduce the accuracy of fruit detection and localization in automated apple harvesting. Based on deep-learning techniques, this study aims to develop an accurate fruit detection and localization method under adverse light conditions. This paper explores the LE-YOLO model for accurate and robust apple detection and localization. The traditional YOLOv5 network was enhanced by adding an image enhancement module and an attention mechanism. Additionally, the loss function was improved to enhance detection performance. Secondly, the enhanced network was integrated with a binocular camera to achieve precise apple localization even under adverse lighting conditions. This was accomplished by calculating the 3D coordinates of feature points using the binocular localization principle. Finally, detection and localization experiments were conducted on the established dataset of apples under adverse lighting conditions. The experimental results indicate that LE-YOLO achieves higher accuracy in detection and localization compared to other target detection models. This demonstrates that LE-YOLO is more competitive in apple detection and localization under adverse light conditions. Compared to traditional manual and general automated harvesting, our method enables automated work under various adverse light conditions, significantly improving harvesting efficiency, reducing labor costs, and providing a feasible solution for automation in the field of apple harvesting.

Agriculture (General)
S2 Open Access 2016
Field Monitoring and Automation Using IOT in Agriculture Domain

I. Mohanraj, K. Ashokumar, Prof. Naren.J

Abstract Agriculture sector in India is diminishing day by day which affects the production capacity of ecosystem. There is an exigent need to solve the problem in the domain to restore vibrancy and put it back on higher growth. The paper proposes an e-Agriculture Application based on the framework consisting of KM-Knowledge base and Monitoring modules. To make profitable decisions, farmers need information throughout the entire farming cycle. The required information is scattered in various places which includes real time information such as market prices and current production level stats along with the available primary crop knowledge. A knowledge dataflow model is constructed connecting various scattered sources to the crop structures. The world around is getting automated replacing manual procedures with the advancement of technology, since it is energy efficient and engross minimal man power. The paper proposes the advantages of having ICT in Indian agricultural sector, which shows the path for rural farmers to replace some of the conventional techniques. Monitoring modules are demonstrated using various sensors for which the inputs are fed from Knowledge base. A prototype of the mechanism is carried out using TI CC3200 Launchpad interconnected sensors modules with other necessary electronic devices. A comparative study is made between the developed system and the existing systems. The system overcomes limitations of traditional agricultural procedures by utilizing water resource efficiently and also reducing labour cost.

244 sitasi en Computer Science
arXiv Open Access 2023
The Ethics of Automating Legal Actors

Josef Valvoda, Alec Thompson, Ryan Cotterell et al.

The introduction of large public legal datasets has brought about a renaissance in legal NLP. Many of these datasets are comprised of legal judgements - the product of judges deciding cases. This fact, together with the way machine learning works, means that several legal NLP models are models of judges. While some have argued for the automation of judges, in this position piece, we argue that automating the role of the judge raises difficult ethical challenges, in particular for common law legal systems. Our argument follows from the social role of the judge in actively shaping the law, rather than merely applying it. Since current NLP models come nowhere close to having the facilities necessary for this task, they should not be used to automate judges. Furthermore, even in the case the models could achieve human-level capabilities, there would still be remaining ethical concerns inherent in the automation of the legal process.

en cs.CL, cs.AI
arXiv Open Access 2023
DBNet: Leveraging DBMS for Network Automation

Rithvik Chuppala, Silvery Fu, Sylvia Ratnasamy

We present DBNet, a data-driven network automation framework built on top of a DBMS. DBNet utilizes key primitives of a DBMS including tables, procedures, transactions, logging, and access control to serve the functions of a data-centric network control plane. DBNet accomplishes this functionality by storing mirrored network device states, executing automation programs on these mirror states within the DBMS, and proxying state updates out to the physical devices upon changes to mirror/local state. The framework also stores network telemetry data, performs analytics on the data, uses the analytics to motivate control plane actions, and provides provenance logging features on the actions taken. We apply DBNet to motivating cloud network infrastructure examples and show how developers can use DBNet's interface to express rich user-defined policies. Our preliminary case studies show that the overhead to run DBNet is trivial in the timescales generally relevant for network automation.

en cs.NI
arXiv Open Access 2023
Time-Sensitive Networking (TSN) for Industrial Automation: Current Advances and Future Directions

Tianyu Zhang, Gang Wang, Chuanyu Xue et al.

With the introduction of Cyber-Physical Systems (CPS) and Internet of Things (IoT) technologies, the automation industry is undergoing significant changes, particularly in improving production efficiency and reducing maintenance costs. Industrial automation applications often need to transmit time- and safety-critical data to closely monitor and control industrial processes. Several Ethernet-based fieldbus solutions, such as PROFINET IRT, EtherNet/IP, and EtherCAT, are widely used to ensure real-time communications in industrial automation systems. These solutions, however, commonly incorporate additional mechanisms to provide latency guarantees, making their interoperability a grand challenge. The IEEE 802.1 Time Sensitive Networking (TSN) task group was formed to enhance and optimize IEEE 802.1 network standards, particularly for Ethernet-based networks. These solutions can be evolved and adapted for cross-industry scenarios, such as large-scale distributed industrial plants requiring multiple industrial entities to work collaboratively. This paper provides a comprehensive review of current advances in TSN standards for industrial automation. It presents the state-of-the-art IEEE TSN standards and discusses the opportunities and challenges of integrating TSN into the automation industry. Some promising research directions are also highlighted for applying TSN technologies to industrial automation applications.

en cs.NI
arXiv Open Access 2023
Human-Centered Programming: The Design of a Robotic Process Automation Language

Piotr Gago, Anna Voitenkova, Daniel Jabłonski et al.

RPA (Robotic Process Automation) helps automate repetitive tasks performed by users, often across different software solutions. Regardless of the RPA tool chosen, the key problem in automation is analyzing the steps of these tasks. This is usually done by an analyst with the possible participation of the person responsible for the given activity. However, currently there exists no one-size-fits-all description language, which would allow to record, process, and easily automate steps of specific tasks. Every RPA solution uses a different notation, which is not easily human-readable, editable, and which cannot be applied to a different automation platform. Therefore, in this paper, we propose a new eXtensible Robotic Language (XRL) that can be understood by both programmers and non-programmers to automate repetitive business processes.

en cs.RO, cs.PL

Halaman 12 dari 42523