DiaBiz.Kom

DiaBiz.Kom – the corpus of dialogue texts in Polish. The sample is published under a CC BY-NC-ND 4.0 license. It is part of the larger corpus created within the CLARIN-Biz project. The basis for the dialogues are texts which come from the DiaBiz corpus. The corpus contains transcriptions of telephone conversations conducted according to a prepared scenario. The transcripts of conversations have been manually annotated with a layer of information concerning communicative functions as well as functional and feedback relations. DiaBiz.Kom is the first corpus of this type prepared for the Polish language and will be used to develop a system of dialogue analysis and modules for creating advanced chatbots (Oleksy et al., 2022; Hwaszcz et al., 2023)

Dialogue Material

Dialogue 781168
  • Annotation DiAML-TabSW  -  gold standard:   ⭳  
Dialogue 781752
  • Annotation DiAML-TabSW  -  gold standard:   ⭳  
Dialogue 781978
  • Annotation DiAML-TabSW  -  gold standard:   ⭳  
Dialogue 782016
  • Annotation DiAML-TabSW  -  gold standard:   ⭳  
Dialogue 782044
  • Annotation DiAML-TabSW  -  gold standard:   ⭳  
Dialogue 786296
  • Annotation DiAML-TabSW  -  gold standard:   ⭳  
Dialogue 786364
  • Annotation DiAML-TabSW  -  gold standard:   ⭳