FEW-SHOT BIOMEDICAL NER EMPOWERED BY LLMS-ASSISTED DATA AUGMENTATION AND MULTI-SCALE FEATURE EXTRACTION

Few-shot biomedical NER empowered by LLMs-assisted data augmentation and multi-scale feature extraction

Few-shot biomedical NER empowered by LLMs-assisted data augmentation and multi-scale feature extraction

Blog Article

Abstract Named Entity Recognition (NER) is a fundamental task in processing biomedical text.Due to the limited availability of labeled data, researchers have investigated few-shot learning methods to tackle this challenge.However, replicating the performance of fully supervised methods remains difficult in few-shot scenarios.This paper addresses two main issues.In terms of data augmentation, existing methods primarily focus on replacing content in the original ac30s1 text, which can potentially distort the semantics.

Furthermore, current approaches often neglect sentence features at multiple scales.To overcome these challenges, we utilize ChatGPT to generate enriched data with distinct semantics for the same entities, thereby reducing noisy data.Simultaneously, we employ dynamic convolution to capture multi-scale semantic information in sentences and enhance feature representation based on PubMedBERT.We evaluated the experiments on four biomedical NER datasets (BC5CDR-Disease, NCBI, BioNLP11EPI, BioNLP13GE), and the results party yard cup exceeded the current state-of-the-art models in most few-shot scenarios, including mainstream large language models like ChatGPT.The results confirm the effectiveness of the proposed method in data augmentation and model generalization.

Report this page