분류
2025년 2월
작성일
2024.10.09
수정일
2024.12.28
작성자
양홍진
조회수
41

Enhancing Nested Entity Recognition Using Nested Rule-Based Method on Neural Network

In the field of Natural Language Processing, Named Entity Recognition can be categorized into three main types: flat NER, nested NER, and discontinuous NER. While flat NER has garnered significant attention from researchers, nested NER remains a major challenge. By conducting a thorough analysis of large datasets containing nested entities, this paper identifies distinct patterns within these entities. Building on these patterns, the paper proposes a set of nesting rules and explores their application to the nested NER task. Currently, approaches for handling nested NER include sequence labeling with merged label layers, cascade models, and models based on reading comprehension. Among these, sequence labeling with merged label layers is favored for its simplicity and ease of implementation. Additionally, ChatGPT, a widely used model, is capable of addressing nested NER tasks; thus, the paper applies the proposed nesting rules to both the sequence labeling model and ChatGPT.

To begin, the sequence labeling model with merged label layers is optimized in this study to better incorporate the nesting rules. The experiments utilize a pipeline model to improve the sequence labeling approach, splitting the model into sequence labeling and text classification tasks. During this process, the practice of labeling specific entity categories is abandoned. Instead, entity types are unified into main and subcategories and embedded into the recognized text as identifiers for the text classification task. The model choices for the two tasks include a BERT+BiLSTM+CRF model for sequence labeling and a BERT model for text classification. Experiments on three nested NER datasets: GENIA, CMeEE, and GermEval 2014, demonstrate that the improved models significantly outperform the original methods and exhibit strong competitiveness against existing models. The F1 scores for these datasets were 79.21, 66.71, and 87.81, respectively.

Additionally, the paper attempts to apply these rules to ChatGPT. In ChatGPT, the nesting rules proposed in this study are provided as prompt instructions, which guide ChatGPT in identifying and labeling entities in sentences according to these rules. When testing the GENIA dataset with ChatGPT, the results show a substantial improvement in performance compared to zero-shot, and few-shot modes. Through the application of these nesting rules, the experiments not only improve the accuracy of nested entity recognition but also highlight the need for and potential of further research in this field.

 

학위연월
2025년2월
지도교수
권혁철
키워드
Nested entity; NER; Sequence labeling; Text classification; Merged label; BERT model
소개 웹페이지
https://sites.google.com/view/a-nested-rule-based-approach
첨부파일
첨부파일이(가) 없습니다.
다음글
A Low-cost Deep Learning Model for Real-time Surveillance Video Defogging and Low Light Enhancement
등 제강 2024-10-10 14:42:33.83
이전글
다양한 도메인과 데이터 형식에 강건한 사전학습 언어모델 기반의 표 질의응답 방법
조상현 2024-10-09 13:03:45.703
RSS 2.0 123
게시물 검색
박사학위논문
번호 제목 작성자 작성일 첨부파일 조회수
123 Uncertainty-Based Hybrid Deep Learning Approach fo 멘가라 악셀 기드온 2024.12.10 0 5
122 Effective Deep Learning Primitives Design for Bina 황선진 2024.10.14 0 30
121 Toward Immersive Multi-view Video Streaming 탄중 디온 2024.10.14 0 15
120 A Low-cost Deep Learning Model for Real-time Surve 등 제강 2024.10.10 0 39
119 Enhancing Nested Entity Recognition Using Nested R 양홍진 2024.10.09 0 41
118 다양한 도메인과 데이터 형식에 강건한 사전학습 언어모델 기반의 표 질의응답 방법 조상현 2024.10.09 0 35
117 Trust Guard Extension for Enhanced Security Featur 김해용 2024.05.04 0 61
116 Task-Specific Differential Private Data Publish Me 신진명 2024.04.09 0 67
115 Advanced Defense Framework against Physical Advers 김용수 2024.04.08 0 87
114 한글 메신저 채팅의 크로스 텍스팅 탐지를 위한 저자 검증 모형 이다영 2024.04.05 0 88
113 상태 기반 테스트 시나리오 보강 방법 이선열 2023.10.17 0 158
112 Manufacturing Testing Automation FrameworkBased on 강효은 2023.10.17 0 181
111 Synthesizing Robust Physical Camouflage for Univer 수랸토 나우팔 2023.10.16 0 171
110 복잡도 다양성을 고려한 C 프로그램의 시험 용이성 예측 모형 구축 방법 최현재 2023.10.16 0 146
109 Design and Optimization of Quantum Arithmetic Circ 라라사티 하라스타 타티마 2023.10.13 0 173
108 Improving 6TiSCH Network Formation and Transmissio 파와즈 자키 자키얄 2023.10.10 0 162
107 저지연 고신뢰 운전자 프로파일링을 위한 딥러닝 모델 및 조기 종료 기법 임재봉 2023.10.08 0 223
106 802.11ax 대규모 Wi-Fi 환경의 심층 생성 모델을 활용한 트래픽 모델링 및 AP 이재민 2023.04.07 0 137
105 뉴런 클러스터를 활용한 합성곱 신경망 이미지 분류 신뢰성 향상 방법 이영우 2023.04.06 0 128
104 Trust Guard Extension Framework for Enhanced Secur 김해용 2023.04.06 0 107