Empowering Diverse Cultural Representation

We analyze datasets to enhance cultural representation, focusing on non-western narratives and voices, ensuring a balanced and inclusive approach in AI-generated content and research.

A computer screen displaying a coding interface with Python code related to machine learning. The code imports libraries like sklearn and deals with model metrics such as precision and recall. A classification report is shown along with a section titled 'Different meta model trained' listing various models like DT, RF, LR, and XGB. Below, there is code for tuning an XGB model using GridSearchCV.
A computer screen displaying a coding interface with Python code related to machine learning. The code imports libraries like sklearn and deals with model metrics such as precision and recall. A classification report is shown along with a section titled 'Different meta model trained' listing various models like DT, RF, LR, and XGB. Below, there is code for tuning an XGB model using GridSearchCV.
Our Mission
Our Vision

Through innovative NLP techniques, we quantify cultural representation, construct indices for voice deficits, and evaluate AI outputs to promote inclusivity and diversity in technology and research.

Cultural Analysis

Exploring cultural representation through data and NLP techniques.

A computer screen displaying a webpage about ChatGPT, focusing on optimizing language models for dialogue. The webpage has text describing the model and includes the OpenAI logo. The background is green with some purple graphical elements on the side.
A computer screen displaying a webpage about ChatGPT, focusing on optimizing language models for dialogue. The webpage has text describing the model and includes the OpenAI logo. The background is green with some purple graphical elements on the side.
Voice Index

Constructing an index for cultural representation analysis.

A display screen shows information about ChatGPT, a language model for dialogue optimization. The text includes details on how the model is used in conversational contexts. The background is primarily green, with pink and purple graphic lines on the right side. The OpenAI logo is positioned at the top left.
A display screen shows information about ChatGPT, a language model for dialogue optimization. The text includes details on how the model is used in conversational contexts. The background is primarily green, with pink and purple graphic lines on the right side. The OpenAI logo is positioned at the top left.
A woman wearing a headscarf and mask is engaged in an interactive session with two young people. She is pointing at a whiteboard filled with comparative adjectives for characters in a list. The setting appears to be an indoor room with red walls, a tiled floor, and closed wooden doors.
A woman wearing a headscarf and mask is engaged in an interactive session with two young people. She is pointing at a whiteboard filled with comparative adjectives for characters in a list. The setting appears to be an indoor room with red walls, a tiled floor, and closed wooden doors.
A close-up view of computer code or data displayed on a screen. The text appears to be part of a JSON-like structure with key-value pairs, featuring words such as 'protected', 'verified', and 'followers'. The text is in white, while some metadata or index numbers are in green.
A close-up view of computer code or data displayed on a screen. The text appears to be part of a JSON-like structure with key-value pairs, featuring words such as 'protected', 'verified', and 'followers'. The text is in white, while some metadata or index numbers are in green.
Model Evaluation

Assessing multicultural text outputs and their representation accuracy.

Contact Us for Dataset Inquiries

Reach out for collaboration on cultural representation analysis projects.

Detailed map displaying data visualization with blue circular markers representing specific data points across a geographical area labeled with city names such as Lisbon and Evora. Bold, brightly colored numerical statistics appear on the left side with the terms 'Suspeitos' and 'Amostras,' suggesting a context of data tracking or analysis.
Detailed map displaying data visualization with blue circular markers representing specific data points across a geographical area labeled with city names such as Lisbon and Evora. Bold, brightly colored numerical statistics appear on the left side with the terms 'Suspeitos' and 'Amostras,' suggesting a context of data tracking or analysis.