Adaptive Perturbation Selection Method Reduces Hallucinations in Audio Language Models

A new arXiv paper (ID 2607.00247) presents an adaptive perturbation selection method aimed at reducing hallucinations in audio language models. Audio language models often generate text inconsistent with the input, known as hallucination. The method dynamically selects the most effective perturbation samples from a candidate pool to enhance model robustness against noise during training and inference. Experimental results show substantial reductions in hallucination rates across multiple benchmarks while maintaining generation quality. This work offers a new path toward building more reliable audio AI systems. In the future, such techniques could be widely applied in voice assistants and automatic transcription services.