AI for Software Engineering

Usability Testing with AI

Does Generative AI Make Usability Testing Obsolete? Our new ICSE'25 paper, co-authored with Ali Ebrahimi Pourasad, reports on a study we conducted to answer this question.

Date: 01 Apr 2025

In brief, our results show that hashtag#GPT and Co. can have a fairly good precision to identify usability issues (~61%). However, they cannot (yet?) replace usability testing and expert reviews. Still, we observed that hashtag#GenAI can serve as a ✳️ valuable supplement ✳️ particularly for small teams with limited resources and expertise to identify issues in less common user paths, due to its ability to consider the source code too.

Most credits for this work should go to Ali, who conducted most of the research during his Master Thesis project at the University of Hamburg. It was a pleasure to supervise and mentor him and now to co-engage in a “PhD adventure”. Congratulations Ali! Twice: for the paper and for the award. Super proud of you.

You can download the Preprint from Arxiv. The source code and the research data are also available in the replication package (link in the paper). This work is part of a bigger initiative, where my team and I are studying opportunities and risks of using Foundation Models and GenAI in Software Engineering & Design with a focus on the Human-AI-Teaming.

Please consider joining the talk at the conference in Ottawa, Canada in April 2025.

AI for Software Engineering

GenAI and Critical Thinking

How do programming students use ChatGPT? And what can we learn from this? Our observational study (published at FSE'25) shows that concerns about potential decrease in programmers' agency and productivity with Generative Ai are justified.

Usability Testing with AI

Next

GenAI and Critical Thinking