부산대학교

분류: 2025년 2월

작성일: 2024.10.09

수정일: 2024.12.28

작성자: 양홍진

조회수: 111

Enhancing Nested Entity Recognition Using Nested Rule-Based Method on Neural Network

In the field of Natural Language Processing, Named Entity Recognition can be categorized into three main types: flat NER, nested NER, and discontinuous NER. While flat NER has garnered significant attention from researchers, nested NER remains a major challenge. By conducting a thorough analysis of large datasets containing nested entities, this paper identifies distinct patterns within these entities. Building on these patterns, the paper proposes a set of nesting rules and explores their application to the nested NER task. Currently, approaches for handling nested NER include sequence labeling with merged label layers, cascade models, and models based on reading comprehension. Among these, sequence labeling with merged label layers is favored for its simplicity and ease of implementation. Additionally, ChatGPT, a widely used model, is capable of addressing nested NER tasks; thus, the paper applies the proposed nesting rules to both the sequence labeling model and ChatGPT.

To begin, the sequence labeling model with merged label layers is optimized in this study to better incorporate the nesting rules. The experiments utilize a pipeline model to improve the sequence labeling approach, splitting the model into sequence labeling and text classification tasks. During this process, the practice of labeling specific entity categories is abandoned. Instead, entity types are unified into main and subcategories and embedded into the recognized text as identifiers for the text classification task. The model choices for the two tasks include a BERT+BiLSTM+CRF model for sequence labeling and a BERT model for text classification. Experiments on three nested NER datasets: GENIA, CMeEE, and GermEval 2014, demonstrate that the improved models significantly outperform the original methods and exhibit strong competitiveness against existing models. The F1 scores for these datasets were 79.21, 66.71, and 87.81, respectively.

Additionally, the paper attempts to apply these rules to ChatGPT. In ChatGPT, the nesting rules proposed in this study are provided as prompt instructions, which guide ChatGPT in identifying and labeling entities in sentences according to these rules. When testing the GENIA dataset with ChatGPT, the results show a substantial improvement in performance compared to zero-shot, and few-shot modes. Through the application of these nesting rules, the experiments not only improve the accuracy of nested entity recognition but also highlight the need for and potential of further research in this field.

학위연월: 2025년2월

지도교수: 권혁철

키워드: Nested entity; NER; Sequence labeling; Text classification; Merged label; BERT model

소개 웹페이지: https://sites.google.com/view/a-nested-rule-based-approach

첨부파일: 첨부파일이(가) 없습니다.

다음글: A Low-cost Deep Learning Model for Real-time Low Light Image Enhancement and Defogging
등 제강 2024-10-10 14:42:33.83

이전글: 다양한 도메인과 데이터 형식에 강건한 사전학습 언어모델 기반의 표 질의응답 방법
조상현 2024-10-09 13:03:45.703

번호	제목	작성자	작성일	조회수
132	확산 모델 기반 필기 이미지 생성에 관한 연구	홍동진	2025.04.10	69
131	연합 학습 기반 전기차 충전 인프라 최적 운영 및 전력망 안정을 위한 유연성 자원 활용 연	류준우	2025.04.09	65
130	Design and Analysis of Quantum Circuits for Inform	와다니 리니 위스누	2025.04.08	67
129	Towards computation - communication efficient and	응우옌 민 두옹	2025.04.08	69
128	Quantum Convolutional Neural Networks for Classifi	노대일	2025.04.08	69
127	Service Management for Reliable Distributed 6G IoT	응우옌 쑤언 둥	2025.04.08	57
126	Large Language Model for Penetration Testing Domai	데리 프라타마	2025.04.07	96
125	Discovery and Authentication of Marker Genes Using	프라타마 리안 다니스 아디	2025.04.07	82
124	산업 환경의 IEEE 802.15.4 TSCH 기반 네트워크에서 트래픽 처리량 향상을 위한	이희준	2025.04.07	95
123	Uncertainty-Based Hybrid Deep Learning Approach fo	멘가라 악셀 기드온	2024.12.10	120
122	Effective Deep Learning Primitives Design for Bina	황선진	2024.10.14	127
121	Toward Immersive Multiview Video Streaming through	탄중 디온	2024.10.14	89
120	A Low-cost Deep Learning Model for Real-time Low L	등 제강	2024.10.10	143
119	Enhancing Nested Entity Recognition Using Nested R	양홍진	2024.10.09	111
118	다양한 도메인과 데이터 형식에 강건한 사전학습 언어모델 기반의 표 질의응답 방법	조상현	2024.10.09	126
117	Trust Guard Extension for Enhanced Security Featur	김해용	2024.05.04	150
116	Task-Specific Differential Private Data Publish Me	신진명	2024.04.09	165
115	Advanced Defense Framework against Physical Advers	김용수	2024.04.08	189
114	한글 메신저 채팅의 크로스 텍스팅 탐지를 위한 저자 검증 모형	이다영	2024.04.05	162
113	상태 기반 테스트 시나리오 보강 방법	이선열	2023.10.17	242

Enhancing Nested Entity Recognition Using Nested Rule-Based Method on Neural Network

분류

게시글 리스트