Reproducible, Explainable, and Effective Evaluations of Agentic AI for Software Engineering
Jingyue Li, André Storhaug
With the advancement of Agentic AI, researchers are increasingly leveraging autonomous agents to address challenges in software engineering (SE). However, the large language models (LLMs) that underpin these agents often function as black boxes, making it difficult to justify the superiority of Agentic AI approaches over baselines. Furthermore, missing information in the evaluation design description frequently renders the reproduction of results infeasible. To synthesize current evaluation practices for Agentic AI in SE, this study analyzes 18 papers on the topic, published or accepted by ICSE 2026, ICSE 2025, FSE 2025, ASE 2025, and ISSTA 2025. The analysis identifies prevailing approaches and their limitations in evaluating Agentic AI for SE, both in current research and potential future studies. To address these shortcomings, this position paper proposes a set of guidelines and recommendations designed to empower reproducible, explainable, and effective evaluations of Agentic AI in software engineering. In particular, we recommend that Agentic AI researchers make their Thought-Action-Result (TAR) trajectories and LLM interaction data, or summarized versions of these artifacts, publicly accessible. Doing so will enable subsequent studies to more effectively analyze the strengths and weaknesses of different Agentic AI approaches. To demonstrate the feasibility of such comparisons, we present a proof-of-concept case study that illustrates how TAR trajectories can support systematic analysis across approaches.
Identifying Ethical Challenges in XR Implementations in the Industrial Domain: A Case of Off-Highway Machinery
Anastasia Sergeeva, Claudia Negri-Ribalta, Gabriele Lenzini
Although extended reality(XR)-using technologies have started to be discussed in the industrial setting, it is becoming important to understand how to implement them ethically and privacy-preservingly. In our paper, we summarise our experience of developing XR implementations for the off-highway machinery domain by pointing to the main challenges we identified during the work. We believe that our findings can be a starting point for further discussion and future research regarding privacy and ethical challenges in industrial applications of XR.
Simulation and Experimental Analysis of L-Section in Reinforced Cement Concrete: Uncertainties in Performance and Strength
Balakrishna Srinivas Maddodi, Nithesh Naik, Prasanna Kumar Maddikeri
et al.
The design and construction of reinforced cement concrete (RCC) flooring play a crucial role in the overall stability of a structure, particularly in regions prone to tectonic activity. RCC floors comprise various beams, including intermediate T-sections and specific L-sections at critical points such as corners and around staircases or lift openings. This paper identifies a key challenge in building frameworks to resist tectonic loads. It further explores the components of the structure that provide potential for interruption, capability, and the safe transfer of tectonic loading to the array connection, all while maintaining sufficient strength. The L-sections were experimented on using various grades of concrete and sizes to reinforce connections under diverse loading conditions. L-sections contribute to reducing floor height, solving economic and technical problems, and creating advanced composite connections that integrate the proposed structural system. The analysis was conducted both analytically and experimentally to assess methods to resist earthquake forces based on stiffness, building strength, and elasticity capacity. These approaches have been identified to safeguard buildings during substantial seismic events. The development of the L-section is detailed, highlighting the loading process and the capacity to overcome various structural challenges.
Engineering machinery, tools, and implements
Recent Trends in Azeotropic Mixture Separation: A Comprehensive Review
Christel Cortel, Kristine Oira Flordeliza, Shayne Ruzzel A. Galvez
et al.
In the chemical and process industries, different techniques are implemented to enable the separation of azeotropic mixtures. These separation methods are broadly classified as azeotropic distillation procedures, which employ the use of entrainers, and membrane-based processes, which mainly use semi-permeable membrane materials. This paper seeks to examine the current trends employed in the separation procedures for azeotropic mixtures in industry, particularly the techniques and methods applied in the actual processes. Additionally, this paper also outlines the common issues encountered in the current setups for the separation of azeotropic mixtures. Several studies show that in comparison to conventional separation techniques, the application of alternative distillation methods and advanced membrane-based techniques for the separation of azeotropic mixtures results in better separation efficiency and reduced energy consumption while also maintaining the cost-effectiveness of the overall process. In addition to this, advancements in the available techniques for separation would also improve the viability and ensure the long-term sustainability of the proposed developments, addressing the current gaps in knowledge while ensuring that existing challenges in the procedures, like membrane fouling and limited scalability, are properly addressed. Furthermore, this paper also highlights the research outlook of the processes involved in the separation of azeotropic mixtures.
Engineering machinery, tools, and implements
Fast Firefighting Water Capacity Assessment Using a Streamlined Single-Loop Hybrid Search
Felipe Hernández
The water distribution system firefighting capacity is widely estimated using decades-old methods that are inefficient for the scale of typical models nowadays. This article introduces an updated algorithm that streamlines the estimation by using a single iterative loop that simultaneously solves for hydrant capacity and the hydraulic effects on the network. The method features a hybrid physically based and heuristic local search approach, and a strategy to easily rank the criticality of network elements that might violate service level constraints. Tests on three models of varying size demonstrate the significant accuracy and efficiency benefits of the proposed approach.
Engineering machinery, tools, and implements
Preparation of Rotor Geometry for Setting up Numerical Model of Flow in Refining Ladle
Lukáš Manoch, Ladislav Socha, Jana Sviželová
et al.
Foundry Degassing Units (FDU) are used for refining aluminum alloys. For an ideal refining process using an FDU unit, it is necessary to select several parameters, which are linked to each other. For a rotary impeller, we searched for several parameters, such as its optimal shape, speed in the liquid alloy, and distance from the bottom of the refining ladle, where the aforementioned parameters contribute to the overall wear and life of the rotor and, consequently, of the rotor shaft. The Computational Fluid Dynamics (CFD) method can be used to determine the above-mentioned parameters. This paper describes the particular steps of preparation of rotor geometry for the subsequent setting up of the basic numerical model.
Engineering machinery, tools, and implements
European nanomaterial legislation in the past 20 years - Closing the final gaps.
Maria Bille Nielsen, L. Skjolding, A. Baun
et al.
In 2004, the potential societal implications related to nanotechnology were highlighted in an influential report by the Royal Society and the Royal Academy of Engineering (RS & RAE). It was made clear that legislation is an important tool to tackle the challenges related to nanomaterials and a list of recommendations were put forward. Shortly after, the European Commission also proposed a list of recommendations on how to handle nanomaterial challenges and adopted the so-called "incremental approach", describing that current legislations should be adapted, where relevant, to handle nanomaterials. Now almost 20 years have passed and it seems relevant to take stock and investigate how legislations have been adapted to tackle nano-specific challenges. In this review, we analyze key pieces of European legislations relevant to nanomaterials and assess to what extent these legislations compare with the original recommendations from 2004 by the RS & RAE and the European Commission. We uncover the cross-cutting challenges that remain and provide recommendations on next steps that should be taken to address the risks of nanomaterials. For each recommendation, we assessed whether it was met to a high, medium or low degree by conducting targeted literature searches at Web of Science, screening legislations, guidance documents, databases etc., and applying expert judgement. We found that >90% of the recommendations put forward in 2004 by the RS & RAE and the European Commission have been either met to a high degree (13 out of 29) or met to a medium degree (14 out of 29). This suggests important advancements in the field of nanosafety. At the same time, it is important to address the concerns still left partly or fully unsolved. Such efforts entail e.g. further development of measuring instruments and standardised characterization and risk assessment methods for nanomaterials, application of a uniform nanomaterial definition, maximization of containment of free nanomaterials until hazards assessed/handled and elimination/minimisation of unintentional nanomaterial emission. Furthermore, we recommend prioritising future efforts to ensure enforcement and implementation of existing nano-specific provisions, as well as revision, where needed, of legislations that currently do not account for nanomaterials, such as the Waste Framework Directive.
A Phenomenography Study of STEM Teachers’ Conceptions of Using Three-Dimensional Modeling and Printing (3DMP) in Teaching
Branko Anđić, Eva Ulbrich, T. Dana-Picard
et al.
There is a large amount of research that indicates that the use of 3DMP in STEM education improves students’ knowledge, motivation, and participation in the learning process. Nevertheless, despite the existing attempts to market 3DMP in education, its adoption in schools remains low. A number of studies with teachers in secondary schools and colleges indicate that teachers’ perceptions of 3DMP are one of the key factors for its successful use. However, to our best knowledge, there is no research that examined STEM upper primary school teachers’ perception on 3DMP. Through phenomenographic approach, this study is seeking to address the existing gaps. Four conceptions of 3DMP teaching emerged: (1) 3DMP as tools for classroom modernization; (2) 3DMP technical and software characteristics’ impact on implementation; (3) 3DMP as a tool for learning and improvement in teaching; (4) 3DMP and students’ professional orientation, teachers’ professional development. These four categories are connected by five key aspects of variation: impact on students, impact on teachers, classroom activity management, authenticity, subject-curriculum matters. The results of our study indicate that the mathematics and science teachers have a more sophisticated opinion on 3DMP than teachers of technical education, engineering, and informatics who mostly require additional training when it comes to using 3DMP in STEM education. Comparatively, science and mathematics teachers need support with implementation of software and 3D printers as a technical tool. Considering that this study’s teachers were early adopters of 3DMP, any future research should explore conceptions of experienced users.
A Secure IoT-Cloud Based Remote Health Monitoring for Heart Disease Prediction Using Machine Learning and Deep Learning Techniques
Sibo Prasad Patro, Neelamadhab Padhy
The Internet of Things (IoT) refers to a network of interconnected devices as well as technology that enables objects to communicate with one another and the cloud for modern medical treatment. To analyze and handle remotely collected electronic clinical records, it is important to create a disease prediction model with increased accuracy. An RHMIoT framework is proposed in a secure cloud context using lightweight block encryption and decryption approaches. The accuracy levels of cardiac disease are calculated using machine learning and deep learning methods. The ensemble voting classifier provided the greatest accuracy of 95%.
Engineering machinery, tools, and implements
Ultrasensitive Raman Spectroscopy-Based Virus Detection Using Glycan-Coated Plasmonic Substrates
Ojodomo J. Achadu, Enoch Y. Park
Hepatitis viral infections are the most common cause of hepatitis liver disease, which eventually leads to cancer and fibrosis if not detected early. Therefore, early detection would allow for preventive and therapeutic actions. Here, a surface-enhanced Raman spectroscopy (SERS)-based biosensor was developed using plasmonic molybdenum trioxide quantum dots (MoO<sub>3</sub>-QDs) as the SERS substrates. The nanostructured substrate of MoO<sub>3</sub>-QDs was functionalized with a proteoglycan (syndecan-1) as a novel bioreceptor for the target hepatitis E virus (HEV). The innovative biodetection system achieved a detection limit of 1.05 fg/mL for the tested HEV target (ORF2), indicating superb clinically relevant sensitivity and performance. The designed biosensing system incorporating a glycan motif as a bioreceptor instead of the conventional antibodies or aptamers presents new insights for the ultrasensitive detection of HEV and other infectious viruses.
Engineering machinery, tools, and implements
Internet of Things-Based Fuzzy Logic Controller for Smart Soil Health Monitoring: A Case Study of Semi-Arid Regions of India
Rajan Prasad, Rajinder Tiwari, Adesh Kumar Srivastava
The human population continues to grow, and specific efforts must be made in order to meet foreseeable food demands. In this paper, it is suggested that an IoT-based fuzzy control system be used for smart soil monitoring systems. This study is based on the semi-arid regions of India. A fuzzy classifier is used to categorize the real-time data into three parameters, such as sodium, potassium, and calcium, based on the proposed model, which gets trained from a dataset and then chooses the optimal solution. The real-time data are collected from NPK sensors, which are suitable for sensing the content of nitrogen, phosphorus, and potassium in the territory, which helps in determining the fertility of the soil by facilitating the systematic assessment of the soil condition. With the aid of this system, a farmer would be able to monitor soil health in a real-time environment and also track the growth of their plants. Farmers will be able to enhance productivity while decreasing resource waste with the aid of an IoT-enabled fuzzy system. Experimental data have been collected from Mahoba district, Uttar Pradesh provinces in India, and the results show that the suggested system is a more reliable and precise concept used for precision farming that will certainly enhance the overall production of crops with better quality. These results obtained with the help of the proposed model system have been compared with the existing one with data accuracy that has been improved and well accepted.
Engineering machinery, tools, and implements
Development of a Chemical Sensor Based on Deep Eutectic Solvents and Its Application for Milk Analysis
Anastasiia Shuba, Ekaterina Anokhina, Ruslan Umarkhanov
et al.
Deep eutectic solvents (DESs) have unique physical and chemical properties, such as low vapor pressure, ease of synthesis, stability, and non-toxicity. Although they have found application in areas of research such as organic synthesis, electrochemistry, biocatalysis, and the development of biosensors, their use as sensitive coatings for chemical sensors has not been previously considered. This study examines the fundamental principles of generating sensitive coatings for piezoelectric quartz sensors utilizing hydrophilic deep eutectic solvents (choline + polyalcohols). Thin films from DESs with a melting point above 50 °C, including those in the composite coatings with amorphous silicon oxide, have been studied. The sorption characteristics of the coatings were thoroughly examined via piezoelectric quartz microbalance. It has been demonstrated that the limits of detection and determination of volatile organic compounds in aqueous solutions by films based on DESs exhibit lower limits than other polymer coatings. A novel approach is proposed for processing the kinetic curve of the sorption of volatile substances by films based on DES to improve the reliability and detection of volatile compounds in the gas phase above aqueous solutions. The use of DES-based piezoelectric quarts sensors has been demonstrated for assessing microbiological indicators of milk.
Engineering machinery, tools, and implements
Limit conditions on local failure of nuclear containment steels (Tensile tests for notched plate and notched thick plate specimens)
Hitoshi NAKAMURA, Kensaku ARAI, Masaaki KIKUCHI
A series of tensile tests were conducted for notched plate specimens whose shapes are similar to the pressure shell to clarify the local failure characteristics for the steel nuclear containment. The specimens included the full or partial-width notched plate specimens of 10 mm thickness, and full-width notched thick plate specimens of 34 mm thickness the same as the actual pressure shell. In the full-width notched plate specimens with a small curvature notch, the cracking occurred on the notch bottom, and the crack propagated and finally ruptured. The plate specimens with a large curvature notch failed in shear suddenly without notch bottom cracking. Those failure modes were very unlike notched round-bar specimens, in which failure always starts at the notch section center. In the case of partial-width notch plate specimens, the notch bottom cracking also occurred for a small curvature notch, and the notch section failure happened before the specimen’s rupture for a large curvature notch. The limit strain diagrams were 40% to 50% lower than those of the notched round-bar specimens. The limit failure stress indexes, which correspond to the stress indications for the limit strain diagrams, could be applied to the shear failure of notched plate specimens. The indexes of the notched plate specimens were approximately 10% lower than those of the round-bar specimens. The crack initiation points for small curvature notch specimens were concentrated in the range of stress triaxiality 0.6 and equivalent plastic strain 0.5 to 0.8. In the full-width notched thick plate specimens, notch bottom cracking occurred for all specimens, and the cracking conditions were almost the same as the full and partial notched plate specimens. However, the maximum nominal stress exceeded 2/3 of the design tensile strength, and it was concluded that local failure occurs at more than twice the design pressure.
Mechanical engineering and machinery, Engineering machinery, tools, and implements
Wireless Communication Technologies for Smart Grid Distribution Networks
Juan Carlos Rodriguez, Felipe Grijalva, Marcelo García
et al.
The modernization of the current electric power grid into a smart grid requires the integration of advanced instrumentation, automation, and communication technologies to optimize efficiency, safety, and reliability. In traditional power grids, communication and control tasks are concentrated in substations, limiting their coverage to high-power equipment. As distributed energy resources increase in different sections of the grid, power flow becomes bi-directional. This requires monitoring and control at the Transmission and Distribution (T&D) level, which forms the largest portion of the power grid. To achieve efficient energy flow management and enable consumer participation in demand management, the integration of information and communication technologies (ICTs) is essential. Wireless sensor networks (WSNs) have been identified as a suitable solution for communications within the distribution network. An ongoing challenge, however, is the definition of the best candidates to solve this problem, among the currently available wireless technologies. This paper reviews different wireless communication technologies that provide robustness, reliability, speed, scalability, and cost-effectiveness for monitoring distribution lines. An outline of the architecture for smart grid communications, the definition of sensor network requirements for power line environments, and an overview of specific studies focusing on technology comparisons are the main contributions of this paper. The purpose of this review is to delineate current technologies in order to establish potential future research directions within the field.
Engineering machinery, tools, and implements
Summary of 2nd International Workshop on Requirements Engineering and Testing (RET)
Elizabeth Bjarnason, Mirko Morandini, Markus Borg
et al.
The RET (Requirements Engineering and Testing) workshop series provides a meeting point for researchers and practitioners from the two separate fields of Requirements Engineering (RE) and Testing. The goal is to improve the connection and alignment of these two areas through an exchange of ideas, challenges, practices, experiences and results. The long term aim is to build a community and a body of knowledge within the intersection of RE and Testing, i.e. RET. The 2nd workshop was held in co-location with ICSE 2015 in Florence, Italy. The workshop continued in the same interactive vein as the 1st one and included a keynote, paper presentations with ample time for discussions, and a group exercise. For true impact and relevance this cross-cutting area requires contribution from both RE and Testing, and from both researchers and practitioners. A range of papers were presented from short experience papers to full research papers that cover connections between the two fields. One of the main outputs of the 2nd workshop was a categorization of the presented workshop papers according to an initial definition of the area of RET which identifies the aspects RE, Testing and coordination effect.
Numerical Study of a PVDF-Based Strain Sensor for Damage Detection of an Asphalt Concrete Pavement Subject to Dynamic Loads
Jiayue Shen, Korkut Bekiroglu, Ali Tekeoglu
et al.
This paper studies the performance of Polyvinylidene fluoride (PVDF)–based strain sensor subject to dynamic loads with different load-moving velocities and the strain sensor’s performance for bottom-up crack detection of an asphalt pavement subject to dynamic loads. The core of the strain sensor is a metalized PVDF sensing film packaged with three protection layers. The encapsulated strain sensor adopts an H-shape to optimize the overall performance. Two numerical models are built in this paper and validate that the voltage output of the PVDF-based strain sensor can well capture the peak lateral strain with the propagation of the bottom-up cracks or the variation of a load moving velocity. Additionally, the sensor has better performance when it is in its lateral alignment position.
Engineering machinery, tools, and implements
An In Situ AFM Study of Electrochemical Bismuth Film Deposition on a Glassy Carbon Substrate Electrode Using a Low Concentration of Bismuth Ions
Ahmed Kreta, Samo B. Hočevar
The bismuth film electrode (BiFE), which was first introduced in 2000 for electrochemical stripping analysis, is now widely used in electroanalytical laboratories worldwide. Numerous scientists have been inspired to conduct more research and broaden the understanding of the BiFE’s favourable electroanalytical performance, which is comparable to, or in some cases even exceeds, that of its mercury counterparts for the detection of heavy metal ions and selected organic compounds. Various types of bismuth-based paste electrodes, as well as in-situ and ex-situ prepared BiFE, have been presented in combination with potentiometric and voltammetric (stripping) protocols. Whereas the ex situ prepared electrodes must be moved from the modification solution to the measuring cell device and often need to display improved stability for several measurements, the ex-situ-prepared bismuth films require acceptable physical and chemical stability. In this study, we provided insight into the commencement of the formation of the bismuth film on a glassy carbon substrate electrode (GCE) when using a very low concentration of bismuth ions. We used our home-constructed AFM cell to fit in commercial working GCE, a platinum wire as the counter electrode, and an Ag/AgCl/NaCl (3 M) as the reference electrode.
Engineering machinery, tools, and implements
Numerical computing in engineering mathematics
Firuz Kamalov, Ho-Hon Leung
The rapid advances in technology over the last decade have significantly altered the nature of engineering knowledge and skills required in the modern industries. In response to the changing professional requirements, engineering institutions have updated their curriculum and pedagogical practices. However, most of the changes in the curriculum have been focused on the core engineering courses without much consideration for the auxiliary courses in mathematics and sciences. In this paper, we aim to propose a new, augmented mathematics curriculum aimed at meeting the requirements of the modern, technology-based engineering workplace. The proposed updates require minimal resources and can be seamlessly integrated into the existing curriculum.
Lean Service-inventory Management Integrated Model to Improve the Service Level in a Metalworking Company
F. G. Alfaro, D. V. Jacinto, A. E. Flores
et al.
The importance of the metalworking sector in Peru is reflected in the high demand for machinery, equipment and structures in various economic sectors such as industry, construction, mining, transportation, among others. This makes it a generator of large productive links and employment. Today, the struggle to follow a quality standard that guarantees the production of these goods persists, as well as the effort to improve the customer's perception of the service offered. This study proposes an integrated model to increase the service level of a metalworking company through the use of Lean Service and Inventory Management tools. The results show a 10.13% increase in the service level, thanks to the implementation of engineering concepts such as 5s, Kaizen and Inventory Management.
User Experience Analytics pada Sistem Informasi Politeknik Negeri Ujung Pandang
Nurfaida Hm, Eddy Tungadi, Irfan Syamsuddin
The Ujung Pandang State Polytechnic (PNUP) manages several departments, consisting of the Department of Electrical Engineering, the Department of Machinery, the Department of Civil, the Department of Chemistry, the Department of Accounting, and the Department of Commerce Administration. In managing how many significant programs, PNUP requires a quality information system to support academic activities by evaluating the system's quality that has been implemented. The PNUP Information System with the URL address https://.www.poliupg.ac.id/ has never been evaluated, where there are several complaints from students and admins regarding obstacles to using the information system. Complaints about the information system's use prompted researchers to research the quality of the PNUP information system. Therefore, the User Experience was analysed from the usability aspect, which was tested with several tools, namely Woorank, SEOquake, PageSpeed and using the SUS method. Testing with the usability aspect approach was carried out on the information system at PNUP. The test results using several tools, namely Woorank, got a score of 78. It was considered suitable for facilitating searches on search engines. Page Speed scored 67 for desktop use and 28 for mobile use. It was considered very bad in loading website pages which were considered to disturb user satisfaction and using the SUS method, with 180 respondents getting a score of 58.65, which is considered usable and has the potential to reduce visitors to the PNUP information system. It can be judged that the PNUP information system has not met the usability aspect.