%0 Conference Proceedings %T Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %E Ovalle, Anaelia %E Chang, Kai-Wei %E Mehrabi, Ninareh %E Pruksachatkun, Yada %E Galystan, Aram %E Dhamala, Jwala %E Verma, Apurv %E Cao, Trista %E Kumar, Anoop %E Gupta, Rahul %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F trustnlp-2023-trustworthy %U https://aclanthology.org/2023.trustnlp-1.0 %0 Conference Proceedings %T Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training %A Li, Dongfang %A Hu, Baotian %A Chen, Qingcai %A He, Shan %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F li-etal-2023-towards %R 10.18653/v1/2023.trustnlp-1.1 %U https://aclanthology.org/2023.trustnlp-1.1 %U https://doi.org/10.18653/v1/2023.trustnlp-1.1 %P 1-14 %0 Conference Proceedings %T Driving Context into Text-to-Text Privatization %A Arnold, Stefan %A Yesilbas, Dilara %A Weinzierl, Sven %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F arnold-etal-2023-driving %R 10.18653/v1/2023.trustnlp-1.2 %U https://aclanthology.org/2023.trustnlp-1.2 %U https://doi.org/10.18653/v1/2023.trustnlp-1.2 %P 15-25 %0 Conference Proceedings %T Automated Ableism: An Exploration of Explicit Disability Biases in Sentiment and Toxicity Analysis Models %A Narayanan Venkit, Pranav %A Srinath, Mukund %A Wilson, Shomir %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F narayanan-venkit-etal-2023-automated %R 10.18653/v1/2023.trustnlp-1.3 %U https://aclanthology.org/2023.trustnlp-1.3 %U https://doi.org/10.18653/v1/2023.trustnlp-1.3 %P 26-34 %0 Conference Proceedings %T Pay Attention to the Robustness of Chinese Minority Language Models! Syllable-level Textual Adversarial Attack on Tibetan Script %A Cao, Xi %A Dawa, Dolma %A Qun, Nuo %A Nyima, Trashi %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F cao-etal-2023-pay-attention %R 10.18653/v1/2023.trustnlp-1.4 %U https://aclanthology.org/2023.trustnlp-1.4 %U https://doi.org/10.18653/v1/2023.trustnlp-1.4 %P 35-46 %0 Conference Proceedings %T Can we trust the evaluation on ChatGPT? %A Aiyappa, Rachith %A An, Jisun %A Kwak, Haewoon %A Ahn, Yong-yeol %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F aiyappa-etal-2023-trust %R 10.18653/v1/2023.trustnlp-1.5 %U https://aclanthology.org/2023.trustnlp-1.5 %U https://doi.org/10.18653/v1/2023.trustnlp-1.5 %P 47-54 %0 Conference Proceedings %T Improving Factuality of Abstractive Summarization via Contrastive Reward Learning %A Chern, I-chun %A Wang, Zhiruo %A Das, Sanjan %A Sharma, Bhavuk %A Liu, Pengfei %A Neubig, Graham %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F chern-etal-2023-improving %R 10.18653/v1/2023.trustnlp-1.6 %U https://aclanthology.org/2023.trustnlp-1.6 %U https://doi.org/10.18653/v1/2023.trustnlp-1.6 %P 55-60 %0 Conference Proceedings %T Examining the Causal Impact of First Names on Language Models: The Case of Social Commonsense Reasoning %A Jeoung, Sullam %A Diesner, Jana %A Kilicoglu, Halil %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F jeoung-etal-2023-examining %R 10.18653/v1/2023.trustnlp-1.7 %U https://aclanthology.org/2023.trustnlp-1.7 %U https://doi.org/10.18653/v1/2023.trustnlp-1.7 %P 61-72 %0 Conference Proceedings %T Reliability Check: An Analysis of GPT-3’s Response to Sensitive Topics and Prompt Wording %A Khatun, Aisha %A Brown, Daniel %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F khatun-brown-2023-reliability %R 10.18653/v1/2023.trustnlp-1.8 %U https://aclanthology.org/2023.trustnlp-1.8 %U https://doi.org/10.18653/v1/2023.trustnlp-1.8 %P 73-95 %0 Conference Proceedings %T Sample Attackability in Natural Language Adversarial Attacks %A Raina, Vyas %A Gales, Mark %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F raina-gales-2023-sample %R 10.18653/v1/2023.trustnlp-1.9 %U https://aclanthology.org/2023.trustnlp-1.9 %U https://doi.org/10.18653/v1/2023.trustnlp-1.9 %P 96-107 %0 Conference Proceedings %T A Keyword Based Approach to Understanding the Overpenalization of Marginalized Groups by English Marginal Abuse Models on Twitter %A Yee, Kyra %A Schoenauer Sebag, Alice %A Redfield, Olivia %A Eck, Matthias %A Sheng, Emily %A Belli, Luca %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F yee-etal-2023-keyword %R 10.18653/v1/2023.trustnlp-1.10 %U https://aclanthology.org/2023.trustnlp-1.10 %U https://doi.org/10.18653/v1/2023.trustnlp-1.10 %P 108-120 %0 Conference Proceedings %T An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models %A Hosseini, Saghar %A Palangi, Hamid %A Awadallah, Ahmed Hassan %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F hosseini-etal-2023-empirical %R 10.18653/v1/2023.trustnlp-1.11 %U https://aclanthology.org/2023.trustnlp-1.11 %U https://doi.org/10.18653/v1/2023.trustnlp-1.11 %P 121-134 %0 Conference Proceedings %T Linguistic Properties of Truthful Response %A Lee, Bruce W. %A Arockiaraj, Benedict Florance %A Jin, Helen %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F lee-etal-2023-linguistic %R 10.18653/v1/2023.trustnlp-1.12 %U https://aclanthology.org/2023.trustnlp-1.12 %U https://doi.org/10.18653/v1/2023.trustnlp-1.12 %P 135-140 %0 Conference Proceedings %T Debunking Biases in Attention %A Chen, Shijing %A Naseem, Usman %A Razzak, Imran %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F chen-etal-2023-debunking %R 10.18653/v1/2023.trustnlp-1.13 %U https://aclanthology.org/2023.trustnlp-1.13 %U https://doi.org/10.18653/v1/2023.trustnlp-1.13 %P 141-150 %0 Conference Proceedings %T Guiding Text-to-Text Privatization by Syntax %A Arnold, Stefan %A Yesilbas, Dilara %A Weinzierl, Sven %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F arnold-etal-2023-guiding %R 10.18653/v1/2023.trustnlp-1.14 %U https://aclanthology.org/2023.trustnlp-1.14 %U https://doi.org/10.18653/v1/2023.trustnlp-1.14 %P 151-162 %0 Conference Proceedings %T Are fairness metric scores enough to assess discrimination biases in machine learning? %A Jourdan, Fanny %A Risser, Laurent %A Loubes, Jean-michel %A Asher, Nicholas %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F jourdan-etal-2023-fairness %R 10.18653/v1/2023.trustnlp-1.15 %U https://aclanthology.org/2023.trustnlp-1.15 %U https://doi.org/10.18653/v1/2023.trustnlp-1.15 %P 163-174 %0 Conference Proceedings %T DEPTH+: An Enhanced Depth Metric for Wikipedia Corpora Quality %A Alshahrani, Saied %A Alshahrani, Norah %A Matthews, Jeanna %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F alshahrani-etal-2023-depth %R 10.18653/v1/2023.trustnlp-1.16 %U https://aclanthology.org/2023.trustnlp-1.16 %U https://doi.org/10.18653/v1/2023.trustnlp-1.16 %P 175-189 %0 Conference Proceedings %T Distinguishing Fact from Fiction: A Benchmark Dataset for Identifying Machine-Generated Scientific Papers in the LLM Era. %A Mosca, Edoardo %A Abdalla, Mohamed Hesham Ibrahim %A Basso, Paolo %A Musumeci, Margherita %A Groh, Georg %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F mosca-etal-2023-distinguishing %R 10.18653/v1/2023.trustnlp-1.17 %U https://aclanthology.org/2023.trustnlp-1.17 %U https://doi.org/10.18653/v1/2023.trustnlp-1.17 %P 190-207 %0 Conference Proceedings %T Detecting Personal Information in Training Corpora: an Analysis %A Subramani, Nishant %A Luccioni, Sasha %A Dodge, Jesse %A Mitchell, Margaret %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F subramani-etal-2023-detecting %R 10.18653/v1/2023.trustnlp-1.18 %U https://aclanthology.org/2023.trustnlp-1.18 %U https://doi.org/10.18653/v1/2023.trustnlp-1.18 %P 208-220 %0 Conference Proceedings %T Enhancing textual counterfactual explanation intelligibility through Counterfactual Feature Importance %A Bhan, Milan %A Vittaut, Jean-noel %A Chesneau, Nicolas %A Lesot, Marie-jeanne %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F bhan-etal-2023-enhancing %R 10.18653/v1/2023.trustnlp-1.19 %U https://aclanthology.org/2023.trustnlp-1.19 %U https://doi.org/10.18653/v1/2023.trustnlp-1.19 %P 221-231 %0 Conference Proceedings %T Privacy- and Utility-Preserving NLP with Anonymized data: A case study of Pseudonymization %A Yermilov, Oleksandr %A Raheja, Vipul %A Chernodub, Artem %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F yermilov-etal-2023-privacy %R 10.18653/v1/2023.trustnlp-1.20 %U https://aclanthology.org/2023.trustnlp-1.20 %U https://doi.org/10.18653/v1/2023.trustnlp-1.20 %P 232-241 %0 Conference Proceedings %T GPTs Don’t Keep Secrets: Searching for Backdoor Watermark Triggers in Autoregressive Language Models %A Lucas, Evan %A Havens, Timothy %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F lucas-havens-2023-gpts %R 10.18653/v1/2023.trustnlp-1.21 %U https://aclanthology.org/2023.trustnlp-1.21 %U https://doi.org/10.18653/v1/2023.trustnlp-1.21 %P 242-248 %0 Conference Proceedings %T Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data %A Li, Xinzhe %A Liu, Ming %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F li-liu-2023-make %R 10.18653/v1/2023.trustnlp-1.22 %U https://aclanthology.org/2023.trustnlp-1.22 %U https://doi.org/10.18653/v1/2023.trustnlp-1.22 %P 249-259 %0 Conference Proceedings %T Training Data Extraction From Pre-trained Language Models: A Survey %A Ishihara, Shotaro %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F ishihara-2023-training %R 10.18653/v1/2023.trustnlp-1.23 %U https://aclanthology.org/2023.trustnlp-1.23 %U https://doi.org/10.18653/v1/2023.trustnlp-1.23 %P 260-275 %0 Conference Proceedings %T Expanding Scope: Adapting English Adversarial Attacks to Chinese %A Liu, Hanyu %A Cai, Chengyuan %A Qi, Yanjun %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F liu-etal-2023-expanding %R 10.18653/v1/2023.trustnlp-1.24 %U https://aclanthology.org/2023.trustnlp-1.24 %U https://doi.org/10.18653/v1/2023.trustnlp-1.24 %P 276-286 %0 Conference Proceedings %T IMBERT: Making BERT Immune to Insertion-based Backdoor Attacks %A He, Xuanli %A Wang, Jun %A Rubinstein, Benjamin %A Cohn, Trevor %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F he-etal-2023-imbert %R 10.18653/v1/2023.trustnlp-1.25 %U https://aclanthology.org/2023.trustnlp-1.25 %U https://doi.org/10.18653/v1/2023.trustnlp-1.25 %P 287-301 %0 Conference Proceedings %T On The Real-world Performance of Machine Translation: Exploring Social Media Post-authors’ Perspectives %A Gupta, Ananya %A Takeuchi, Jae %A Knijnenburg, Bart %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F gupta-etal-2023-real %R 10.18653/v1/2023.trustnlp-1.26 %U https://aclanthology.org/2023.trustnlp-1.26 %U https://doi.org/10.18653/v1/2023.trustnlp-1.26 %P 302-310 %0 Conference Proceedings %T Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values %A Bang, Yejin %A Yu, Tiezheng %A Madotto, Andrea %A Lin, Zhaojiang %A Diab, Mona %A Fung, Pascale %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F bang-etal-2023-enabling %R 10.18653/v1/2023.trustnlp-1.27 %U https://aclanthology.org/2023.trustnlp-1.27 %U https://doi.org/10.18653/v1/2023.trustnlp-1.27 %P 311-325 %0 Conference Proceedings %T Strength in Numbers: Estimating Confidence of Large Language Models by Prompt Agreement %A Portillo Wightman, Gwenyth %A Delucia, Alexandra %A Dredze, Mark %Y Ovalle, Anaelia %Y Chang, Kai-Wei %Y Mehrabi, Ninareh %Y Pruksachatkun, Yada %Y Galystan, Aram %Y Dhamala, Jwala %Y Verma, Apurv %Y Cao, Trista %Y Kumar, Anoop %Y Gupta, Rahul %S Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023) %D 2023 %8 July %I Association for Computational Linguistics %C Toronto, Canada %F portillo-wightman-etal-2023-strength %R 10.18653/v1/2023.trustnlp-1.28 %U https://aclanthology.org/2023.trustnlp-1.28 %U https://doi.org/10.18653/v1/2023.trustnlp-1.28 %P 326-362