Accepted Papers

Long Papers

  • A Multilingual Dataset for Investigating Stereotypes and Negative Attitudes Towards Migrant Groups in Large Language Models, Danielly Sorato, Carme Colominas Ventura and Diana Zavala-Rojas
  • Is ChatGPT an effective solver of sentiment analysis tasks in Portuguese? A Preliminary Study, Gladson de Araujo, Tiago de Melo and Carlos Maurício S. Figueiredo
  • Simple and Fast Automatic Prosodic Segmentation of Brazilian Portuguese Spontaneous Speech, Giovana Meloni Craveiro, Vinicius Gonçalves Santos, Gabriel Jose Pellisser Dalalana, Flaviane R. Fernandes Svartman and Sandra Maria Aluísio
  • LLMs and Translation: different approaches to localization between Brazilian Portuguese and European Portuguese, Eduardo G. Cortes, Ana Luiza Vianna, Mikaela Martins, Sandro Rigo and Rafael Kunst
  • SPARQL can also talk in Portuguese: answering natural language questions with knowledge graphs, Elbe Miranda, Aline Paes and Daniel de Oliveira
  • NLP for historical Portuguese: Analysing 18th-century medical texts, Leonardo Zilio, Rafaela Radünz Lazzari and Maria Jose Bocorny Finatto
  • Automatic Text Readability Assessment in European Portuguese, Eugénio Ribeiro, Nuno Mamede and Jorge Baptista
  • Named entity recognition specialised for Portuguese 18th-century History research, Joaquim Santos, Helena Freire Cameron, Fernanda Olival, Fátima Farrica and Renata Vieira
  • Using Large Language Models for Identifying Satirical News in Brazilian Portuguese, Gabriela Wick-Pedro, Cássio Faria da Silva, Marcio Lima Inácio, Oto Araújo Vale and Helena de Medeiros Caseli
  • Hurdles in Parsing Multi-word Adverbs: Examples from Portuguese, Izabela Muller, Nuno Mamede and Jorge Baptista
  • Portal NURC-SP: Design, Development, and Speech Processing Tools to Support the Public Dissemination of Portuguese Spoken Language, Ana Carolina Rodrigues, Alessandra A. Macedo, Arnaldo Candido Jr, Flaviane R. F. Svartman, Giovana M. Craveiro, Marli Quadros Leite, Sandra M. Aluísio, Vinícius G. Santos and Vinícius M. Garcia
  • TransAlign: An Automated Corpus Generation through Cross-Linguistic Data Alignment for Open Information Extraction, Alan Rios, Bruno Cabral, Daniela Claro, Rerisson Cavalcante and Marlo Souza
  • BATS-PT: Assessing Portuguese Masked Language Models in Lexico-Semantic Analogy Solving and Relation Completion, Hugo Gonçalo Oliveira, Ricardo Rodrigues, Bruno Ferreira, Purificação Silvano and Sara Carvalho
  • A New Benchmark for Automatic Essay Scoring in Portuguese, Igor Cataneo Silveira, André Barbosa and Denis Deratani Mauá
  • Predicting the Age of Emergence of Consonants, Luís Jesus and Jihen Trabelsi
  • Exploring Computational Discernibility of Discourse Domains in Brazilian Portuguese within the Carolina Corpus, Felipe Ribas Serras, Mariana Sturzeneker, Miguel de Mello Carpi, Mayara Feliciano Palma, Maria Clara Ramos Morales Crespo, Aline Silva Costa, Vanessa Martins Do Monte, Cristiane Namiuti, Maria Clara Paixão de Souza and Marcelo Finger
  • A Corpus of Stock Market Tweets Annotated with Named Entities, Michel Monteiro Zerbinati, Norton Trevisan Roman and Ariani Di Felippo
  • Frequency, overlap and origins of palatal sonorants in three Iberian languages, Carlos Silva and Luís Trigo
  • A Named Entity Recognition Approach for Portuguese Legislative Texts Using Self-Learning, Rafael Oleques Nunes, Dennis Giovani Balreira, André Suslik Spritzer and Carla Maria Dal Sasso Freitas
  • Exploring Portuguese Hate Speech Detection in Low-Resource Settings: Lightly Tuning Encoder Models or In-Context Learning of Large Models?, Gabriel Assis, Annie Amorim, Jonnatahn Carvalho, Daniel de Oliveira, Daniela Vianna and Aline Paes
  • Aspect-based sentiment analysis in comments on political debates in Portuguese: evaluating the potential of ChatGPT, Eloize Seno, Lucas Silva, Fábio Anno, Fabiano Rocha and Helena Caseli
  • CLSJUR.BR – A Model for Abstractive Summarization of Legal Documents in Portuguese Language based on Contrastive Learning, Alex Aguiar Lins, Cecilia Silvestre Carvalho, Francisco Das Chagas Jucá Bomfim, Daniel de Carvalho Bentes and Vládia Pinheiro
  • Investigating the Generalizability of Portuguese Readability Assessment Models Trained Using Linguistic Complexity Features, Soroosh Akef, Amália Mendes, Detmar Meurers and Patrick Rebuschat
  • Across the Atlantic: Distinguishing Between European and Brazilian Portuguese Dialects, David Preda, Tomás Osório and Henrique Lopes Cardoso
  • Accent Classification is Challenging but Pre-training Helps: a case study with novel Brazilian Portuguese datasets, Ariadne Matos, Gustavo Araújo, Arnaldo Candido Junior and Moacir Ponti
  • RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese, Eduardo A. S. Garcia, Nadia F. F. Silva, Felipe Siqueira, Hidelberg O. Albuquerque, Juliana R. S. Gomes, Ellen Souza and Eliomar A. Lima
  • Evaluating Pre-training Strategies for Literary Named Entity Recognition in Portuguese, Mariana O. Silva and Mirella M. Moro
  • Towards Portparser – a highly accurate parsing system for Brazilian Portuguese following the Universal Dependencies framework, Lucelene Lopes and Thiago Pardo
  • Study of the State of the Art Galician Machine Translation: English-Galician and Spanish-Galician models, Sofía García González and German Rigau Claramunt
  • Bartoli’s areal norms revisited: an agent-based modeling approach, Dalmo Buzato and Evandro Cunha
  • RePro: a benchmark for Opinion Mining for Brazilian Portuguese, Lucas Nildaimon dos Santos Silva, Livy Real, Ana Claudia Bianchini Zandavalle, Carolina Francisco Gadelha Rodrigues, Tatiana da Silva Gama, Fernando Guedes Souza and Phillipe Derwich Silva Zaidan
  • GlórIA: A Generative and Open Large Language Model for Portuguese, Ricardo Lopes, Joao Magalhaes and David Semedo
  • Question Answering for Dialogue State Tracking in Portuguese, Francisco Pais, Patricia Ferreira, Catarina Silva, Ana Alves and Hugo Gonçalo Oliveira
  • Toxic Content Detection in online social networks: a new dataset from Brazilian Reddit Communities, Luiz Henrique Quevedo Lima, Adriana Silvina Pagano and Ana Paula Couto da Silva
  • From Random to Informed Data Selection: A Diversity-Based Approach to Optimize Human Annotation and Few-Shot Learning, Alexandre Alcoforado, Lucas Hideki Takeuchi Okamura, Israel Campos Fama, Bárbara Fernandes Dias Bueno, Arnold Moya Lavado, Thomas Palmeira Ferraz, Bruno Veloso and Anna Helena Reali Costa
  • A Galician Corpus for Misogyny Detection Online, Lucía M. Álvarez-Crespo and Laura M. Castro
  • Exploring Pre-Trained Transformers for Translating Portuguese Text to Brazilian Sign Language, Jose Mario De Martino and Dener Stassun Christinele
  • Text Summarization and Temporal Learning Models Applied to Portuguese Fake News Detection in a Novel Brazilian Corpus Dataset, Gabriel Lino Garcia, Pedro Henrique Paiola, Danilo Samuel Jodas, Luis Afonso Sugi and João Paulo Papa
  • Toxic Speech Detection in Portuguese: A Comparative Study of Large Language Models, Amanda da Silva Oliveira, Thiago de Carvalho Cecote, João Paulo Reis Alvarenga, Vander Luis de Souza Freitas and Eduardo José da Silva Luz
  • Exploring Open Information Extraction for Portuguese Using Large Language Models, Bruno Cabral, Daniela Claro and Marlo Souza
  • Bringing Pragmatics to Porttinari – Adding Speech Acts to News Texts, Nataly L. Patti da Silva, Norton Trevisan Roman and Ariani Di Felippo
  • Authorship Attribution with Rejection Capability in Challenging Contexts of Limited Datasets, Pedro Oliveira and Joaquim Silva
  • Semantic Permanence in Audiovisual Translation: a FrameNet approach to subtitling, Mairon Samagaio, Tiago Torrent, Ely Matos and Arthur Almeida
  • Towards the automatic creation of NER systems for new domains, Emanuel Matos, Mário Rodrigues and António Teixeira
  • Applying event classification to reveal the Estado da Índia, Gonçalo C. Albuquerque, Marlo Souza, Renata Vieira and Ana Sofia Ribeiro
  • Identification of Types of Event-Time Temporal Relations in Portuguese Using a Rule-Based Approach, Dárcio S. Rocha, Marlo Souza and Daniela B. Claro
  • UlyssesNERQ: Expanding Queries from Brazilian Portuguese Legislative Documents through Named Entity Recognition, Hidelberg Albuquerque, Ellen Souza, Tainan Silva, Rafael P. Gouveia, Flavio Junior, Douglas Vitório, Nádia F. F. da Silva, André C.P.L.F. de Carvalho, Adriano L.I. Oliveira and Francisco Edmundo de Andrade
  • Brazilian Portuguese Product Reviews Moderation with AutoML, Lucas Nildaimon dos Santos Silva, Livy Real, Fernando Rezende Zagatti, Ana Claudia Bianchini Zandavalle, Tatiana da Silva Gama and Carolina Francisco Gadelha Rodrigues
  • Evaluating large language models for the tasks of PoS tagging within the Universal Dependency framework, Mateus Machado and Evandro Ruiz
  • A Natural Language Text to Role-Playing Game Animation Generator, Caio F. Oliveira, Artur Franco, Wellington Franco and José G. R. Maia
  • Enhancing Stance Detection in Low-Resource Brazilian Portuguese Using Corpus Expansion generated by GPT-3.5, Dyonnatan Maia and Nádia Félix Felipe da Silva

Short Papers

  • A Bag-of-Users approach to mental health prediction from social media data, Rafael Oliveira and Ivandré Paraboni
  • Semi-automatic corpus expansion: the case of stance prediction, Camila Pereira and Ivandré Paraboni
  • Sequence-to-sequence and transformer approaches to Portuguese text style transfer, Pablo Costa and Ivandré Paraboni
  • Comparative Analysis of Intentional Gramatical Error Correction Techniques on Twitter/X, Thainá Marini and Taffarel Brant-Ribeiro
  • Towards a Syntactic Lexicon of Brazilian Portuguese Adjectives, Ryan Martinez, Jorge Baptista and Oto Vale
  • Literary similarity of novels in Portuguese, Diana Santos
  • An evaluation of Portuguese language models’ adaptation to African Portuguese varieties, Diego Fernando Válio Antunes Alves
  • Text Readability Assessment in European Portuguese: A Comparison of Classification and Regression Approaches, Eugénio Ribeiro, Nuno Mamede and Jorge Baptista
  • Is it safe to machine translate suicide-related language from English to Galician?, John E. Ortega and Annika Marie Schoene
  • Exploring Multimodal Models for Humor Recognition in Portuguese, Marcio Inácio and Hugo Gonçalo Oliveira
  • RecognaSumm: A Novel Brazilian Summarization Dataset, Pedro Henrique Paiola, Gabriel Lino Garcia, Danilo Samuel Jodas, João Vitor Mariano Correia, Luis Afonso Sugi and João Paulo Papa
  • A Speech-Driven Talking Head based on a Two-Stage Generative Architecture, Brayan Bernardo and Paula Costa
  • Increasing manually annotated resources for Galician: the Parallel Universal Dependencies Treebank, Xulia Sánchez-Rodríguez, Albina Sarymsakova, Laura Castro and Marcos Garcia
  • CorpusNÓS: A massive Galician corpus for training large language models, Iria de-Dios-Flores, Silvia Paniagua Suárez, Cristina Carbajal Pérez, Daniel Bardanca Outeiriño, Marcos Garcia and Pablo Gamallo
  • Exploring the effects of vocabulary size in neural machine translation: Galician as a target language, Daniel Bardanca Outeirinho, Pablo Gamallo Otero, Iria de-Dios-Flores and José Ramom Pichel Campos
  • A Reproducibility Analysis of Portuguese Computational Processing Conferences: A Case of Study, Daniel Leal, Anthony Luz and Rafael Anchiêta
  • Natural Language Processing Application in Legislative Activity: a Case Study of Similar Amendments in the Brazilian Senate, Diany Pressato, Pedro Lucas Castro de Andrade, Flávio Rocha Junior, Felipe Alves Siqueira, Ellen Polliana Ramos Souza, Nádia Félix Felipe da Silva, Márcio de Souza Dias and André Carlos Ponce de Leon Ferreira de Carvalho
  • Spatial Information Challenges in English to Portuguese Machine Translation, Rafael Fernandes, Rodrigo Souza, Marcos Lopes, Paulo Santos and Thomas Finbow
  • First assessment of Graph Machine Learning approaches to Portuguese Named Entity Recognition, Gabriel Silva, Mário Rodrigues, António Teixeira and Marlene Amorim
  • Automated admissibility of complaints about fraud and corruption, Thiago De Paula, André Do Amaral, Andre Victor, Luis Alberto Sales, Rodrigo Moreira, Thiago Meirelles and Rafael Basso
  • Compilation and tagging of a corpus with Celpe-Bras texts, Juliana Schoffen, Elisa Stumpf, Deise Amaral, Luiza Divino, Isadora Hanauer, Isabel Lisboa, Amanda Raupp and Brenda Xavier