📚 Michele's Notes

❯

Scientific Literature References

❯

literature notes

❯

A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data

A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data

Nov 17, 20251 min read

year : 2019
authors : Yinhe Zheng, Rongsheng Zhang, Xiaoxi Mao, Minlie Huang
repository : arXiv
proceedings :
journal :
volume :
issue :
publisher :
doi : 10.48550/arXiv.1911.04700
Abstract : Endowing dialogue systems with personas is essential to deliver more human-like conversations. However, this problem is still far from well explored due to the difficulties of both embodying personalities in natural languages and the persona sparsity issue observed in most dialogue corpora. This paper proposes a pre-training based personalized dialogue model that can generate coherent responses using persona-sparse dialogue data. In this method, a pre-trained language model is used to initialize an encoder and decoder, and personal attribute embeddings are devised to model richer dialogue contexts by encoding speakers' personas together with dialogue histories. Further, to incorporate the target persona in the decoding process and to balance its contribution, an attention routing structure is devised in the decoder to merge features extracted from the target persona and dialogue contexts using dynamically predicted weights. Our model can utilize persona-sparse dialogues in a unified manner during the training process, and can also control the amount of persona-related features to exhibit during the inference process. Both automatic and manual evaluation demonstrates that the proposed model outperforms state-of-the-art methods for generating more coherent and persona consistent responses with persona-sparse data.

research

A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data

Zheng et al_2019_A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse.pdf

Notes

Graph View

A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data
Notes

Backlinks

Retrain or Refactoring Approaches

Created with Quartz v4.4.0 © 2025

GitHub
Discord Community