Outlet Title
2024 IEEE International Conference on Machine Learning and Applications (ICMLA-24)
Document Type
Conference Proceeding
Publication Date
Winter 12-2024
Abstract
This study explores the use of LLMs for providing quantitative zero-shot sentiment analysis of implicit software desirability, addressing a critical challenge in product evaluation where traditional review scores, though convenient, fail to capture the richness of qualitative user feedback. Innovations include establishing a method that 1) works with qualitative user experience data without the need for explicit review scores, 2) focuses on implicit user satisfaction, and 3) provides scaled numerical sentiment analysis, offering a more nuanced understanding of user sentiment, instead of simply classifying sentiment as positive, neutral, or negative.
Data is collected using the Microsoft Product Desirability Toolkit (PDT), a well-known qualitative user experience analysis tool. For initial exploration, the PDT metric was given to users of two software systems. PDT data was fed through several LLMs (Claude Sonnet 3 and 3.5, GPT4, and GPT4o) and through a leading transfer learning technique, Twitter-Roberta-Base-Sentiment, and Vader, a leading sentiment analysis tool. Each system was asked to evaluate the data in two ways, by looking at the sentiment expressed in the PDT word/explanation pairs; and by looking at the sentiment expressed by the users in their grouped selection of five words and explanations, as a whole. Numerical analysis is used to provide insights into the magnitude of sentiment to drive high quality decisions regarding product desirability. Each LLM is asked to provide its confidence (low, medium, high) in its sentiment score, along with an explanation of its score.
All LLMs tested were able to statistically detect user sentiment from the users' grouped data, whereas TRBS and Vader were not. The confidence and explanation of confidence provided by the LLMs assisted in understanding user sentiment. This study adds deeper understanding of evaluating user experiences, toward the goal of creating a universal tool that quantifies implicit sentiment.
Recommended Citation
Weitl-Harms, Sherri; Hastings, John D.; and Lum, Jonah, "Using LLMs to Establish Implicit User Sentiment of Software Desirability" (2024). Research & Publications. 84.
https://scholar.dsu.edu/ccspapers/84
Included in
Artificial Intelligence and Robotics Commons, Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons, Marketing Commons, Technology and Innovation Commons