Automated chart review utilizing natural language processing algorithm for asthma predictive index

BMC Pulmonary Medicine 18:34, 2018

BMC series – open, inclusive and trusted

Research article – Open Access – Open Peer Review

https://doi.org/10.1186/s12890-018-0593-9

Harsheen Kaur†, Sunghwan Sohn†, Chung-Il Wi, Euijung Ryu, Miguel A. Park, Kay Bachman, Hirohito Kita, Ivana Croghan, Jose A. Castro-Rodriguez, Gretchen A. Voge, Hongfang Liu, Young J. Juhn

Open Peer Review reports

Abstract

Background

Thus far, no algorithms have been developed to automatically extract patients who meet Asthma Predictive Index (API) criteria from the Electronic health records (EHR) yet. Our objective is to develop and validate a natural language processing (NLP) algorithm to identify patients that meet API criteria.

Methods

This is a cross-sectional study nested in a birth cohort study in Olmsted County, MN. Asthma status ascertained by manual chart review based on API criteria served as gold standard. NLP-API was developed on a training cohort (n = 87) and validated on a test cohort (n = 427). Criterion validity was measured by sensitivity, specificity, positive predictive value and negative predictive value of the NLP algorithm against manual chart review for asthma status. Construct validity was determined by associations of asthma status defined by NLP-API with known risk factors for asthma.

Results

Among the eligible 427 subjects of the test cohort, 48% were males and 74% were White. Median age was 5.3 years (interquartile range 3.6–6.8). 35 (8%) had a history of asthma by NLP-API vs. 36 (8%) by abstractor with 31 by both approaches. NLP-API predicted asthma status with sensitivity 86%, specificity 98%, positive predictive value 88%, negative predictive value 98%. Asthma status by both NLP and manual chart review were significantly associated with the known asthma risk factors, such as history of allergic rhinitis, eczema, family history of asthma, and maternal history of smoking during pregnancy (p value < 0.05). Maternal smoking [odds ratio: 4.4, 95% confidence interval 1.8–10.7] was associated with asthma status determined by NLP-API and abstractor, and the effect sizes were similar between the reviews with 4.4 vs 4.2 respectively.

Conclusion

NLP-API was able to ascertain asthma status in children mining from EHR and has a potential to enhance asthma care and research through population management and large-scale studies when identifying children who meet API criteria.

Download PDF

GAA Interasma

Interasma Global Asthma Association

Automated chart review utilizing natural language processing algorithm for asthma predictive index

Leave a Reply Cancel reply

Recent Posts

Tags

Categories

Recent Posts