Conferences >2024 IEEE 12th International ...

Assertion Detection in Clinical Natural Language Processing Using Large Language Models

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this study, we aim to address the task of assertion detection when extracting medical concepts from clinical notes, a key process in clinical natural language processi...Show More

Metadata

Abstract:

In this study, we aim to address the task of assertion detection when extracting medical concepts from clinical notes, a key process in clinical natural language processing (NLP). Assertion detection in clinical NLP usually involves identifying assertion types for medical concepts in the clinical text, namely certainty (whether the medical concept is positive, negated, possible, or hypothetical), temporality (whether the medical concept is for present or the past history), and experiencer (whether the medical concept is described for the patient or a family member). These assertion types are essential for healthcare professionals to quickly and clearly understand the context of medical conditions from unstructured clinical texts, directly influencing the quality and outcomes of patient care. Although widely used, traditional methods, particularly rule-based NLP systems and machine learning or deep learning models, demand intensive manual efforts to create patterns and tend to overlook less common assertion types, leading to an incomplete understanding of the context. To address this challenge, our research introduces a novel methodology that utilizes Large Language Models (LLMs) pre-trained on a vast array of medical data for assertion detection. We enhanced the current method with advanced reasoning techniques, including Tree of Thought (ToT), Chain of Thought (CoT), and Self-Consistency (SC), and refine it further with Low-Rank Adaptation (LoRA) fine-tuning. We first evaluated the model on the i2b2 2010 assertion dataset. Our method achieved a micro-averaged F-1 of 0.89, with 0.11 improvements over the previous works. To further assess the generalizability of our approach, we extended our evaluation to a local dataset that focused on sleep concept extraction. Our approach achieved an F-1 of 0.74, which is 0.31 higher than the previous method. The results show that using LLMs is a viable option for assertion detection in clinical NLP and can potentially integrate w...

Published in: 2024 IEEE 12th International Conference on Healthcare Informatics (ICHI)

Date of Conference: 03-06 June 2024

Date Added to IEEE Xplore: 22 August 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/ICHI61247.2024.00039

Conference Location: Orlando, FL, USA

Funding Agency:

Contents

I. Introduction

Assertion detection is a key task within the area of Clinical Natural Language Processing (NLP) [1]. It usually involves identifying the assertion types for medical concepts in the clinical text, namely certainty (whether the medical concept is positive, negated, possible, or hypothetical), temporality (whether the medical concept is for present or the previous history), and experiencer (whether the medical concept is described for the patient or a family member). Figure 1 shows an example of medical concepts and the corresponding assertions. This task plays a crucial role in understanding medical concepts from the free-text Electronic Health Records (EHRs), directly impacting the accuracy of clinical decision-making and the efficiency of patient care. As a core component of clinical NLP, assertion detection also holds significant potential for enhancing information retrieval and automated clinical reasoning. However, it faces challenges such as class distribution imbalance and the unstructured nature of clinical notes. Particularly challenging is the classification of assertions like ‘Possible’ and ‘Family’, which are often less frequently occurring and ambiguously expressed. Previous studies have widely applied rule-based methods such as NegEx [2] and ConText [1] in clinical NLP software, setting a benchmark in medical informatics with applications in tools like OHNLP Toolkit [3], MedTagger [4], medspaCy [5], and cTAKES [6]. However, these rule-based approaches are limited by their fixed patterns and inability to exhaust all possibilities, often leading to low recall rates. To overcome these limitations, deep learning methods like convolutional neural networks (CNNs) and Long short-term memory (LSTM) [7]–[9] were introduced. Although these approaches show promise, they still require substantial amounts of labeled data and tend to underperform when dealing with small or imbalanced datasets. Fig. 1.

Examples of assertions in clinical texts. Medical concepts and the corresponding assertions are highlighted.

References is not available for this document.

Assertion Detection in Clinical Natural Language Processing Using Large Language Models

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Assertion Detection in Clinical Natural Language Processing Using Large Language Models

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References