This repository provides dataset and other materials for the paper entitled:
Poster can be found here.
dataset
folder include evaluation dataset in different formats that include both manual statements auto generated statements of true/false statements.
Note that the following false statements may contain biases, discrimination, and prejudice and true statement may not necessarily reflect the truth.
We select six topics related to UN SDGs. We consider two experimental setups---one with automatically generated text and another with manually crafted text.
For each chosen topic, five questions and five true and five false statements were manually curated. In curating the questions and corresponding statements, we ensured diversity in the sub-topics covered.
We utilize ChatGPT API (gpt-3.5-turbo model) to generate text for each topic where the ChatGPT agent plays the role of an expert advocate on each topic. We generate 20 questions and use each question as a prompt to generate 20 true and 20 false statements. Therefore, we generate 800 statements per topic, and overall, we generate 4,800 statements for evaluation on automated statements which we refer to as "Auto True/False Statements".
If you are utilizing statements/dataset, please cite our work as follows.
Bahrami, Mehdi, and Ramya Srinivasan. "Examining LLM's Awareness of the United Nations Sustainable Development Goals (SDGs)." ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models. URL:https://openreview.net/pdf?id=0oubWlDUIa
@inproceedings{bahrami2023examining,
title={Examining LLM's Awareness of the United Nations Sustainable Development Goals (SDGs)},
author={Bahrami, Mehdi and Srinivasan, Ramya},
booktitle={ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models}
}