Program at a glance

(subject to change, check the official website)

March 12

Fontán Building

March 13

CINC Building

March 14

CINC Building

March 15

CINC Building

T1, W1

09:30-11:00

Opening

09:00-09:30

Session 4a

Sentiment Analysis & Hate Speech

09:00-10:30

Session 4b Open IE & QA

09:00-10:30

Session 7a

Applications

09:00-10:00

Session 7b

Posters

09:00-10:00

Keynote 1

Elias Feijó

09:30-10:30

Keynote 3:

Marta R. Costa-jussà

10:00-11:00

Coffee Break

11:00-11:30

Coffee Break

10:30-11:00

Coffee Break

10:30-11:00

Coffee Break & Awards

11:00-11:30

W2

11:00-13:00

T1, W1, ST

11:30-13:00

Session 1

Language Models

11:00-12:30

Session 5

Language Models & Text Classification

11:00-12:00

Discussion Panel

A IA e o Futuro da Língua Portuguesa

11:30-13:00

Brasileiras

em PLN

12:30-13:00

Keynote 2

Gemma Boleda

12:00-13:00

Lunch Break

13:00-14:30

Lunch Break

13:00-14:30

Lunch Break

13:00-14:30

Lunch Break

13:00-14:30

T2, W1, W2, W3

14:30-16:00

Session 2a

Readability & Language Analysis

14:30-16:00

Session 2b

Named Entity Recognition

14:30-16:00

Industry Track

Discussion Panel

14:30-16:00

Closing

14:30-15:00

Community Meeting

15:00-16:30

Coffee Break

16:00-16:30

Coffee Break

16:00-16:30

Coffee Break

16:00-16:30

T2, W1, W2, W3

16:30-18:00

Session 3

Data Annotation & Sign Language

16:30-17:30

Session 6

Posters & Demos

16:30-17:30

Open Meeting

Recursos para o Português

17:00-17:30

Guided Tour (Old Town)

"Compostela e as origens da língua"

18:30-20:30

Dinner & Cultural Events

20:30

Guided Visit

CESGA

18:00-19:00

T1: Tutorial 1 - Automatic measurement of distances between languages using Swadesh lists and big text corpora.

T2: Tutorial 2 - Annotating dependency syntax for treebank development

W1: Workshop 1 - 3rd Workshop on Digital Humanities and Natural Language Processing

W2: Workshop 2 - 1st Workshop on NLP for Indigenous Languages of Lusophone Countries

W3: Workshop 3 - 5th OpenCor: Latin American and Iberian Languages Open Corpora Forum

ST: Shared Task - Competition on Automatic Essay Scoring of Portuguese Narrative Essays

March 12 - Workshops & Tutorials

(Fontán Building)

Start

End

Title

08:30

10:00

Registration

Registration Desk

09:30

11:00

Workshops Session 1

Sala 10

Sala 9

Sala 6

Sala 1

Sala 7

09:30

11:00

Tutorial 1:

Automatic measurement of distances between languages using Swadesh lists and big text corpora

3rd Workshop on Digital Humanities and NLP

11:00

11:30

Coffee Break

1st Workshop on NLP for Indigenous Languages of Lusophone Countries

11:30

13:00

Workshops Session 2

Sala 10

Sala 9

Sala 6

Sala 1

11:30

13:00

Tutorial 1:

Automatic measurement of distances between languages using Swadesh lists and big text corpora

3rd Workshop on Digital Humanities and NLP

Competition on Automatic Essay Scoring of Portuguese Narrative Essays Written by Basic School Students

13:00

14:30

Lunch Break

14:30

16:00

Workshops Session 3

Sala 10

Sala 9

Sala 6

Sala 1

Sala 7

14:30

16:00

Tutorial 2:

Annotating dependency syntax for treebank development

3rd Workshop on Digital Humanities and NLP

5th OpenCor: Latin American and Iberian Languages Open Corpora Forum

1st Workshop on NLP for Indigenous Languages of Lusophone Countries

16:00

16:30

Coffee Break

16:30

18:00

Workshops Session 4

Sala 10

Sala 9

Sala 6

Sala 1

Sala 7

16:30

18:00

Tutorial 2:

Annotating dependency syntax for treebank development

3rd Workshop on Digital Humanities and NLP

5th OpenCor: Latin American and Iberian Languages Open Corpora Forum

1st Workshop on NLP for Indigenous Languages of Lusophone Countries

March 13 - Main Conference

(CINC Building)

Start

End

Title

08:30

10:00

Registration

Registration Desk

09:00

09:30

Opening

Sala 1

09:30

10:30

Keynote 1: Elias Feijó

É este galego latim em pó?

Sala 1

10:30

11:00

Coffee Break

Sala de Convencións

11:00

12:30

Session 1: Oral Presentations

Sala 1

11:00

11:20

"Using Large Language Models for Identifying Satirical News in Brazilian Portuguese", Gabriela Wick-Pedro, Cássio Faria da Silva, Marcio Lima Inácio, Oto Araújo Vale and Helena de Medeiros Caseli

11:20

11:40

"Exploring Computational Discernibility of Discourse Domains in Brazilian Portuguese within the Carolina Corpus", Felipe Ribas Serras, Mariana Sturzeneker, Miguel de Mello Carpi, Mayara Feliciano Palma, Maria Clara Ramos Morales Crespo, Aline Silva Costa, Vanessa Martins Do Monte, Cristiane Namiuti, Maria Clara Paixão de Souza and Marcelo Finger

11:40

12:00

"LLMs and Translation: different approaches to localization between Brazilian Portuguese and European Portuguese", Eduardo G. Cortes, Ana Luiza Vianna, Mikaela Martins, Sandro Rigo and Rafael Kunst

12:00

12:20

"Evaluating large language models for the tasks of PoS tagging within the Universal Dependency framework", Mateus Machado and Evandro Ruiz

12:30

13:00

Apresentação do livro de Processamento da Linguagem Natural (2a. edição)

Brasileiras em PLN

Sala 1

13:00

14:30

Lunch Break

Sala de Convencións

14:30

16:00

Session 2a: Oral Presentations

Sala 1

Session 2b: Oral Presentations

Sala 2

14:30

14:50

"Automatic Text Readability Assessment in European Portuguese", Eugénio Ribeiro, Nuno Mamede and Jorge Baptista

"Towards the automatic creation of NER systems for new domains", Emanuel Matos, Mário Rodrigues and António Teixeira

14:50

15:10

"Investigating the Generalizability of Portuguese Readability Assessment Models Trained Using Linguistic Complexity Features", Soroosh Akef, Amália Mendes, Detmar Meurers and Patrick Rebuschat

"A Named Entity Recognition Approach for Portuguese Legislative Texts Using Self-Learning", Rafael Oleques Nunes, Dennis Giovani Balreira, André Suslik Spritzer and Carla Maria Dal Sasso Freitas

15:10

15:30

"Hurdles in Parsing Multi-word Adverbs: Examples from Portuguese", Izabela Muller, Nuno Mamede and Jorge Baptista

"UlyssesNERQ: Expanding Queries from Brazilian Portuguese Legislative Documents through Named Entity Recognition", Hidelberg Albuquerque, Ellen Souza, Tainan Silva, Rafael P. Gouveia, Flavio Junior, Douglas Vitório, Nádia F. F. da Silva, André C.P.L.F. de Carvalho, Adriano L.I. Oliveira and Francisco Edmundo de Andrade

15:30

15:50

"Bringing Pragmatics to Porttinari - Adding Speech Acts to News Texts", Nataly L. Patti da Silva, Norton Trevisan Roman and Ariani Di Felippo

"Named entity recognition specialised for Portuguese 18th-century History research", Joaquim Santos, Helena Freire Cameron, Fernanda Olival, Fátima Farrica and Renata Vieira

16:00

16:30

Coffee Break

Sala de Convencións

16:30

17:30

Session 3: Oral Presentations

Sala 1

16:30

16:50

"A Corpus of Stock Market Tweets Annotated with Named Entities", Michel Monteiro Zerbinati, Norton Trevisan Roman and Ariani Di Felippo

16:50

17:10

"From Random to Informed Data Selection: A Diversity-Based Approach to Optimize Human Annotation and Few-Shot Learning", Alexandre Alcoforado, Lucas Hideki Takeuchi Okamura, Israel Campos Fama, Bárbara Fernandes Dias Bueno, Arnold Moya Lavado, Thomas Palmeira Ferraz, Bruno Veloso and Anna Helena Reali Costa

17:10

17:30

"Exploring Pre-Trained Transformers for Translating Portuguese Text to Brazilian Sign Language", Jose Mario De Martino and Dener Stassun Christinele

18:30

20:30

Visita Guiada "Compostela e as Origens da Língua" pela Cidade Velha de Santiago de Compostela

Saída do Autocarro desde a Cidade da Cultura (ao finalizar a Sessão 3)

Início da Visita: Praza da Pescadaría Vella

March 14 - Main Conference

(CINC Building)

Start

End

Title

08:30

10:00

Registration

Registration Desk

09:00

10:30

Session 4a: Oral Presentations

Sala 1

Session 4b: Oral Presentations

Sala 2

09:00

09:20

"Is ChatGPT an effective solver of sentiment analysis tasks in Portuguese? A Preliminary Study", Gladson de Araujo, Tiago de Melo and Carlos Maurício S. Figueiredo

"SPARQL can also talk in Portuguese: answering natural language questions with knowledge graphs", Elbe Miranda, Aline Paes and Daniel de Oliveira

09:20

09:40

"A Galician Corpus for Misogyny Detection Online", Lucía M. Álvarez-Crespo and Laura M. Castro

"Question Answering for Dialogue State Tracking in Portuguese", Francisco Pais, Patricia Ferreira, Catarina Silva, Ana Alves and Hugo Gonçalo Oliveira

09:40

10:00

"Exploring Portuguese Hate Speech Detection in Low-Resource Settings: Lightly Tuning Encoder Models or In-Context Learning of Large Models?", Gabriel Assis, Annie Amorim, Jonnatahn Carvalho, Daniel de Oliveira, Daniela Vianna and Aline Paes

"Identification of Types of Event-Time Temporal Relations in Portuguese Using a Rule-Based Approach", Dárcio S. Rocha, Marlo Souza and Daniela B. Claro

10:00

10:20

"Toxic Content Detection in online social networks: a new dataset from Brazilian Reddit Communities", Luiz Henrique Quevedo Lima, Adriana Silvina Pagano and Ana Paula Couto da Silva

"TransAlign: An Automated Corpus Generation through Cross-Linguistic Data Alignment for Open Information Extraction", Alan Rios, Bruno Cabral, Daniela Claro, Rerisson Cavalcante and Marlo Souza

10:30

11:00

Coffee Break

Sala de Convencións

11:00

12:00

Session 5: Oral Presentations

Sala 1

11:00

11:20

"Authorship Attribution with Rejection Capability in Challenging Contexts of Limited Datasets", Pedro Oliveira and Joaquim Silva

11:20

11:40

"GlórIA: A Generative and Open Large Language Model for Portuguese", Ricardo Lopes, João Magalhães and David Semedo

11:40

12:00

"Evaluating Pre-training Strategies for Literary Named Entity Recognition in Portuguese", Mariana O. Silva and Mirella M. Moro

12:00

13:00

Keynote 2: Gemma Boleda

Pressures on the lexicon and their effects

Sala 1

13:00

14:30

Lunch Break

Sala de Convencións

14:30

16:00

Industry Track

Sala 1

14:30

14:50

"Challenges and opportunities for the Portuguese/Galician-speaking language industry in the age of LLMs"

Senén Barro Ameneiro (CiTIUS, Galiza), Livy Real (Quinto Andar, Brazil), Diego Vázquez (imaxin|software, Galiza), Carlos Amaral (Priberam, Portugal), Paulo Cavalin (IBM Research, Brazil), José Ramom Pichel (Nós Project, Galiza).

14:50

15:10

15:10

15:30

15:30

15:50

16:00

16:30

Coffee Break

Poster & Demos Start

Sala de Convencións

16:30

17:30

Session 6: Poster & Demos (see titles below)

Sala de Convencións

17:00

17:30

Reunião aberta sobre partilha e disponibilização

de recursos computacionais para processamento do português

Sala 1

20:30

-

Dinner and Cultural Events

(A Nave de Vidán)

Session 6 Posters

  • "Sequence-to-sequence and transformer approaches to Portuguese text style transfer", Pablo Costa and Ivandré Paraboni.
  • "Simple and Fast Automatic Prosodic Segmentation of Brazilian Portuguese Spontaneous Speech", Giovana Meloni Craveiro, Vinicius Gonçalves Santos, Gabriel Jose Pellisser Dalalana, Flaviane R. Fernandes Svartman and Sandra Maria Aluísio.
  • "Text Summarization and Temporal Learning Models Applied to Portuguese Fake News Detection in a Novel Brazilian Corpus Dataset", Gabriel Lino Garcia, Pedro Henrique Paiola, Danilo Samuel Jodas, Luis Afonso Sugi and João Paulo Papa.
  • "An evaluation of Portuguese language models' adaptation to African Portuguese varieties", Diego Fernando Válio Antunes Alves.
  • "Text Readability Assessment in European Portuguese: A Comparison of Classification and Regression Approaches", Eugénio Ribeiro, Nuno Mamede and Jorge Baptista.
  • "Is it safe to machine translate suicide-related language from English to Galician?", John E. Ortega and Annika Marie Schoene.
  • "Semantic Permanence in Audiovisual Translation: a FrameNet approach to subtitling", Mairon Samagaio, Tiago Torrent, Ely Matos and Arthur Almeida.
  • "Portal NURC-SP: Design, Development, and Speech Processing Tools to Support the Public Dissemination of Portuguese Spoken Language", Ana Carolina Rodrigues, Alessandra A. Macedo, Arnaldo Candido Jr, Flaviane R. F. Svartman, Giovana M. Craveiro, Marli Quadros Leite, Sandra M. Aluísio, Vinícius G. Santos and Vinícius M. Garcia.
  • "Exploring Multimodal Models for Humor Recognition in Portuguese", Marcio Inácio and Hugo Gonçalo Oliveira.
  • "BATS-PT: Assessing Portuguese Masked Language Models in Lexico-Semantic Analogy Solving and Relation Completion", Hugo Gonçalo Oliveira, Ricardo Rodrigues, Bruno Ferreira, Purificação Silvano and Sara Carvalho.
  • "A New Benchmark for Automatic Essay Scoring in Portuguese", Igor Cataneo Silveira, André Barbosa and Denis Deratani Mauá.
  • "Predicting the Age of Emergence of Consonants", Luís Jesus and Jihen Trabelsi.
  • "Frequency, overlap and origins of palatal sonorants in three Iberian languages", Carlos Silva and Luís Trigo.
  • "A Speech-Driven Talking Head based on a Two-Stage Generative Architecture", Brayan Bernardo and Paula Costa.
  • "Exploring the effects of vocabulary size in neural machine translation: Galician as a target language", Daniel Bardanca Outeirinho, Pablo Gamallo Otero, Iria de-Dios-Flores and José Ramom Pichel Campos.
  • "Across the Atlantic: Distinguishing Between European and Brazilian Portuguese Dialects", David Preda, Tomás Osório and Henrique Lopes Cardoso.
  • "Accent Classification is Challenging but Pre-training Helps: a case study with novel Brazilian Portuguese datasets", Ariadne Matos, Gustavo Araújo, Arnaldo Candido Junior and Moacir Ponti.
  • "RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese", Eduardo A. S. Garcia, Nadia F. F. Silva, Felipe Siqueira, Hidelberg O. Albuquerque, Juliana R. S. Gomes, Ellen Souza and Eliomar A. Lima.
  • "A Reproducibility Analysis of Portuguese Computational Processing Conferences: A Case of Study", Daniel Leal, Anthony Luz and Rafael Anchiêta.
  • "Brazilian Portuguese Product Reviews Moderation with AutoML", Lucas Nildaimon dos Santos Silva, Livy Real, Fernando Rezende Zagatti, Ana Claudia Bianchini Zandavalle, Tatiana da Silva Gama and Carolina Francisco Gadelha Rodrigues.
  • "Natural Language Processing Application in Legislative Activity: a Case Study of Similar Amendments in the Brazilian Senate", Diany Pressato, Pedro Lucas Castro de Andrade, Flávio Rocha Junior, Felipe Alves Siqueira, Ellen Polliana Ramos Souza, Nádia Félix Felipe da Silva, Márcio de Souza Dias and André Carlos Ponce de Leon Ferreira de Carvalho.
  • "Spatial Information Challenges in English to Portuguese Machine Translation", Rafael Fernandes, Rodrigo Souza, Marcos Lopes, Paulo Santos and Thomas Finbow.

                   

Session 6 System Demonstrations

  • "GiDi: A Virtual Assistant for Screening Protocols at Home", Andrés Piñeiro-Martín, Carmen García-Mateo, Laura Docío-Fernández, María del Carmen López-Pérez and Ignacio Novo-Veleiro.
  • "FazGame: A Game Based Platform that Uses Artificial Intelligence to Help Students to Improve Brazilian Portuguese Writing Skills", Jéssica Soares Dos Santos, Gabriel Coelho, Sidney Melo, Oniram Atila and Carla Zeltzer.
  • "Indexing Portuguese NLP Resources with PT-Pump-Up", Rúben Almeida, Ricardo Campos, Alípio Jorge and Sérgio Nunes.
  • "plain X – AI Supported Multilingual Video Workflow Platform", Carlos Amaral, Catarina Lagrifa, Mirko Lorenz, Peggy van der Kreeft and Tiago Veiga.
  • "Perfil Público: Automatic Generation and Visualization of Author Profiles for Digital News Media", Nuno Guimarães, Ricardo Campos and Alípio Jorge.
  • "Exploring Open Information Extraction for the Portuguese language: An integrated monolithic approach in Cloud environment", Augusto Barreto and Daniela Claro.
  • "Blip Copilot: a smart conversational assistant", Evandro Fonseca, Tayane Soares, Dyovana Baptista, Rogers Damas and Lucas Avanco.
  • "Galician–Portuguese Neural Machine Translation System", Sofía García González.
  • "Nós-TTS: aWeb User Interface for Galician Text-to-Speech", Carmen Magariños, Alp Öktem, Antonio Moscoso Sánchez, Marta Vázquez Abuín, Noelia García Díaz, Adina Ioana Vladu, Elisa Fernández Rei and María Baqueiro Vidal.
  • "Autopilot: a smart sales assistant", Amanda Oliveira, João Alvarenga, Evandro Fonseca and William Colen.

March 15

(Edifício CINC)

Start

End

Title

08:30

10:00

Registration

Registration Desk

09:00

10:00

Session 7: Oral & Poster Presentations

Sala 1: Oral Presentations

Sala de Convencións: Poster Presentations

09:00

09:20

"CLSJUR.BR - A Model for Abstractive Summarization of Legal Documents in Portuguese Language based on Contrastive Learning", Alex Aguiar Lins, Cecilia Silvestre Carvalho, Francisco Das Chagas Jucá Bomfim, Daniel de Carvalho Bentes and Vládia Pinheiro

Poster Session 7

(see titles below)

09:20

09:40

"Applying event classification to reveal the Estado da Índia", Gonçalo C. Albuquerque, Marlo Souza, Renata Vieira and Ana Sofia Ribeiro

09:40

10:00

"A Natural Language Text to Role-Playing Game Animation Generator", Caio F. Oliveira, Artur Franco, Wellington Franco and José G. R. Maia

10:00

11:00

Keynote 3: Marta Ruiz Costa-Jussà

Beyond Semantic Evaluation in Seamless Speech Translation Models

Sala 1

11:00

11:30

Coffee Break & Awards (11:15)

Sala de Convencións (Coffee Break) & Sala 2 (Awards)

11:30

13:00

Discussion Panel:

"A Inteligência Artificial e o Futuro da Língua Portuguesa"

António H. Branco (Honorary President of ELRA & Professor at the University of Lisbon, Portugal), Valentín García (Secretario Xeral de Política Lingüística, Xunta de Galicia, Ana Paula Laborinho (Director of OEI, Portugal), Claudio Pinhanez (IBM & Vice-Director C4AI, Brazil), Ismael Gómez García (Director of Global Digital Strategy at OEI, Spain).

Sala 1

13:00

14:30

Lunch Break

Sala de Convencións

14:30

15:00

Closing Session

Sala 1

15:00

16:30

Community Meeting

Sala 1

18:00

19:30

Visita Guiada ao CESGA:

Centro de Supercomputação da Galiza

Session 7 Posters

  • "A Multilingual Dataset for Investigating Stereotypes and Negative Attitudes Towards Migrant Groups in Large Language Models", Danielly Sorato, Carme Colominas Ventura and Diana Zavala-Rojas.
  • "A Bag-of-Users approach to mental health prediction from social media data", Rafael Oliveira and Ivandré Paraboni.
  • "Semi-automatic corpus expansion: the case of stance prediction", Camila Pereira and Ivandré Paraboni.
  • "Comparative Analysis of Intentional Gramatical Error Correction Techniques on Twitter/X", Thainá Marini and Taffarel Brant-Ribeiro.
  • "Towards a Syntactic Lexicon of Brazilian Portuguese Adjectives", Ryan Martinez, Jorge Baptista and Oto Vale.
  • "Literary similarity of novels in Portuguese", Diana Santos.
  • "NLP for historical Portuguese: Analysing 18th-century medical texts", Leonardo Zilio, Rafaela Radünz Lazzari and Maria Jose Bocorny Finatto.
  • "Exploring Open Information Extraction for Portuguese Using Large Language Models", Bruno Cabral, Daniela Claro and Marlo Souza.
  • "First assessment of Graph Machine Learning approaches to Portuguese Named Entity Recognition", Gabriel Silva, Mário Rodrigues, António Teixeira and Marlene Amorim.
  • "Toxic Speech Detection in Portuguese: A Comparative Study of Large Language Models", Amanda da Silva Oliveira, Thiago de Carvalho Cecote, João Paulo Reis Alvarenga, Vander Luis de Souza Freitas and Eduardo José da Silva Luz.
  • "RecognaSumm: A Novel Brazilian Summarization Dataset", Pedro Henrique Paiola, Gabriel Lino Garcia, Danilo Samuel Jodas, João Vitor Mariano Correia, Luis Afonso Sugi and João Paulo Papa.
  • "Aspect-based sentiment analysis in comments on political debates in Portuguese: evaluating the potential of ChatGPT", Eloize Seno, Lucas Silva, Fábio Anno, Fabiano Rocha and Helena Caseli.
  • "Increasing manually annotated resources for Galician: the Parallel Universal Dependencies Treebank", Xulia Sánchez-Rodríguez, Albina Sarymsakova, Laura Castro and Marcos Garcia.
  • "CorpusNÓS: A massive Galician corpus for training large language models", Iria de-Dios-Flores, Silvia Paniagua Suárez, Cristina Carbajal Pérez, Daniel Bardanca Outeiriño, Marcos Garcia and Pablo Gamallo.
  • "Towards Portparser - a highly accurate parsing system for Brazilian Portuguese following the Universal Dependencies framework", Lucelene Lopes and Thiago Pardo.
  • "Automated admissibility of complaints about fraud and corruption", Thiago De Paula, André Do Amaral, Andre Victor, Luis Alberto Sales, Rodrigo Moreira, Thiago Meirelles and Rafael Basso.
  • "Study of the State of the Art Galician Machine Translation: English-Galician and Spanish-Galician models", Sofía García González and German Rigau Claramunt.
  • "Bartoli's areal norms revisited: an agent-based modeling approach", Dalmo Buzato and Evandro Cunha.
  • "RePro: a benchmark for Opinion Mining for Brazilian Portuguese", Lucas Nildaimon dos Santos Silva, Livy Real, Ana Claudia Bianchini Zandavalle, Carolina Francisco Gadelha Rodrigues, Tatiana da Silva Gama, Fernando Guedes Souza and Phillipe Derwich Silva Zaidan.
  • "Compilation and tagging of a corpus with Celpe-Bras texts", Juliana Schoffen, Elisa Stumpf, Deise Amaral, Luiza Divino, Isadora Hanauer, Isabel Lisboa, Amanda Raupp and Brenda Xavier.
  • "Enhancing Stance Detection in Low-Resource Brazilian Portuguese Using Corpus Expansion generated by GPT-3.5", Dyonnatan Maia and Nádia Félix Felipe da Silva.
  • "TTS applied to the generation of datasets for automatic speech recognition", Edresson Casanova, Sandra Aluísio and Moacir Antonelli Ponti.
  • "Text clustering applied to unbalanced data in legal contexts", Lucas José Gonçalves Freitas.