arXiv Open Access 2025

IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs

Junfeng Jiao Saleh Afroogh Kevin Chen David Atkinson Amit Dhurandhar
Lihat Sumber

Abstrak

This paper introduces IGGA, a dataset of 160 industry guidelines and policy statements for the use of Generative AIs (GAIs) and Large Language Models (LLMs) in industry and workplace settings, collected from official company websites, and trustworthy news sources. The dataset contains 104,565 words and serves as a valuable resource for natural language processing tasks commonly applied in requirements engineering, such as model synthesis, abstraction identification, and document structure assessment. Additionally, IGGA can be further annotated to function as a benchmark for various tasks, including ambiguity detection, requirements categorization, and the identification of equivalent requirements. Our methodologically rigorous approach ensured a thorough examination, with a selection of reputable and influential companies that represent a diverse range of global institutions across six continents. The dataset captures perspectives from fourteen industry sectors, including technology, finance, and both public and private institutions, offering a broad spectrum of insights into the integration of GAIs and LLMs in industry.

Topik & Kata Kunci

Penulis (5)

J

Junfeng Jiao

S

Saleh Afroogh

K

Kevin Chen

D

David Atkinson

A

Amit Dhurandhar

Format Sitasi

Jiao, J., Afroogh, S., Chen, K., Atkinson, D., Dhurandhar, A. (2025). IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs. https://arxiv.org/abs/2501.00959

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓