A Modular and Efficient Framework for the Development of Large Language Model-Based Virtual Humans: An Educational Scenario

Giordano, M.; Berardini, D.; Frontoni, E.; Zingaretti, P.; Stacchio, L.

doi:10.1007/978-3-032-11317-7_52

The integration of Large Language Models into virtual Human systems opens new avenues for creating interactive, intelligent agents capable of natural and personalized human-computer communication. However, the real-time generation and deployment of such avatars remain computationally demanding and often lack modularity or adaptability. In this paper, we propose an efficient and scalable framework for creating LLM-driven virtual humans that balances performance, responsiveness, and expressiveness. Our architecture combines lightweight dialogue management with multimodal synchronization pipelines to support speech and facial animation. The framework includes an optimization layer that enables on-device deployment without compromising interactivity. We demonstrate the effectiveness of our approach by deploying our system into several lightweight devices, showing improvements in latency and adaptability to user input. This work sets the stage for broader use of intelligent avatars in domains such as education, entertainment, and customer support.

A Modular and Efficient Framework for the Development of Large Language Model-Based Virtual Humans: An Educational Scenario

Giordano M.;Berardini D.;Frontoni E.;Zingaretti P.;Stacchio L.

2026-01-01

Abstract

The integration of Large Language Models into virtual Human systems opens new avenues for creating interactive, intelligent agents capable of natural and personalized human-computer communication. However, the real-time generation and deployment of such avatars remain computationally demanding and often lack modularity or adaptability. In this paper, we propose an efficient and scalable framework for creating LLM-driven virtual humans that balances performance, responsiveness, and expressiveness. Our architecture combines lightweight dialogue management with multimodal synchronization pipelines to support speech and facial animation. The framework includes an optimization layer that enables on-device deployment without compromising interactivity. We demonstrate the effectiveness of our approach by deploying our system into several lightweight devices, showing improvements in latency and adaptability to user input. This work sets the stage for broader use of intelligent avatars in domains such as education, entertainment, and customer support.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno di pubblicazione del prodotto

2026

Codice ISBN

9783032113160
9783032113177

File in questo prodotto:

File	Dimensione	Formato
CR___LLM_Virtual_Avatar___Giordano___HUARL.pdf accesso aperto Licenza: Copyright dell'editore Dimensione 1.27 MB Formato Adobe PDF Visualizza/Apri	1.27 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11393/378891

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

0

UniMC - Pubblicazioni Aperte Digitali

A Modular and Efficient Framework for the Development of Large Language Model-Based Virtual Humans: An Educational Scenario

Giordano M.;Berardini D.;Frontoni E.;Zingaretti P.;Stacchio L.

2026-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Attenzione

Citazioni

social impact

UniMC - Pubblicazioni Aperte Digitali

A Modular and Efficient Framework for the Development of Large Language Model-Based Virtual Humans: An Educational Scenario

Giordano M.;Berardini D.;Frontoni E.;Zingaretti P.;Stacchio L.

2026-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)