outstanding Jul. 2024 Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation (ACL-findings 2024)
outstanding Jul. 2024 KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models (ACL-findings 2024)
outstanding May. 2024 Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean (LREC-COLING 2024)
outstanding May. 2024 KNOTICED: Augmentative and Alternative Communication Software for Language Developmental Disabilities (LREC-COLING 2024)
outstanding Mar. 2024 Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing (EACL-findings 2024)
outstanding Mar. 2024 Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation (EACL-findings 2024)
outstanding Dec. 2023 Doubts on the reliability of parallel corpus filtering (Expert Systems with Applications 2023)
outstanding Dec. 2023 KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing (EMNLP 2023)
outstanding Dec. 2023 CReTIHC: Designing Causal Reasoning Tasks about Temporal Interventions and Hallucinated Confoundings (EMNLP-findings 2023)
outstanding Dec. 2023 CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients (EMNLP 2023)
outstanding Nov. 2023 Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection (IJCNLP-AACL 2023)
outstanding Jul. 2023 PEEP-Talk: A Situational Dialogue-based Chatbot for English Education (ACL 2023)
outstanding Oct. 2022 PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledge (Knowledge-Based Systems 2022)
outstanding Oct. 2022 PicTalky: Augmentative and Alternative Communication Software for Language Developmental Disabilities (AACL 2022)
outstanding Oct. 2022 Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learners (IEEE Access 2022)
outstanding Sep. 2022 QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation (COLING 2022)
outstanding Jul. 2022 A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation (Findings of NAACL 2022)
outstanding Jun. 2022 Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing (LREC 2022)
outstanding May. 2022 Dense-to-Question and Sparse-to-Answer: Hybrid Retriever System for Industrial Frequently Asked Questions (Mathematics 2022)
outstanding Oct. 2021 [Best Paper Awards] KommonGen: A Dataset for Korean Generative Commonsense Reasoning Evaluation (HCLT 2021)
outstanding Aug. 2021 BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text (WAT2021)