In this paper, we present a fully Bayesian Beta-Liouville Multinomial mixture model for unsupervised sentiment identification. The Beta-Liouville distribution has recently emerged as a flexible and powerful alternative to the Dirichlet prior in statistical language modeling. While maintaining conjugacy with the Multinomial likelihood, it introduces an additional shape parameter that allows for a more expressive and adaptable covariance structure. For posterior inference, we develop a coordinate ascent variational algorithm that maximizes an analytically tractable lower bound on the marginal likelihood with respect to a set of variational parameters. Crucially, the model is reparameterized so that the Dirichlet-Multinomial mixture appears as a special case of the more general Beta-Liouville formulation, thus providing a unified and versatile probabilistic framework. The proposed methodology is empirically assessed on benchmark datasets involving binary and ternary sentiment classification, as well as irony detection, highlighting its enhanced flexibility in modeling emotional polarity in ultra-short textual data.

Unsupervised sentiment detection with mixtures of Beta-Liouville Multinomial distributions

Bilancia, Massimo
;
Magro, Samuele
2026-01-01

Abstract

In this paper, we present a fully Bayesian Beta-Liouville Multinomial mixture model for unsupervised sentiment identification. The Beta-Liouville distribution has recently emerged as a flexible and powerful alternative to the Dirichlet prior in statistical language modeling. While maintaining conjugacy with the Multinomial likelihood, it introduces an additional shape parameter that allows for a more expressive and adaptable covariance structure. For posterior inference, we develop a coordinate ascent variational algorithm that maximizes an analytically tractable lower bound on the marginal likelihood with respect to a set of variational parameters. Crucially, the model is reparameterized so that the Dirichlet-Multinomial mixture appears as a special case of the more general Beta-Liouville formulation, thus providing a unified and versatile probabilistic framework. The proposed methodology is empirically assessed on benchmark datasets involving binary and ternary sentiment classification, as well as irony detection, highlighting its enhanced flexibility in modeling emotional polarity in ultra-short textual data.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11369/481533
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact