닫기
216.73.216.214
216.73.216.214
close menu
KCI 등재
자연어 처리 기반 『傷寒論』 辨病診斷體系 분류를 위한 기계학습 모델 선정
Selecting Machine Learning Model Based on Natural Language Processing for Shanghanlun Diagnostic System Classification
김영남 ( Young-nam Kim )
UCI I410-ECN-0102-2023-500-001184182

Objective : The purpose of this study is to explore the most suitable machine learning model algorithm for Shanghanlun diagnostic system classification using natural language processing (NLP). Methods : A total of 201 data items were collected from 『Shanghanlun』and 『Clinical Shanghanlun』, ‘Taeyangbyeong-gyeolhyung’ and ‘Eumyangyeokchahunobokbyeong’ were excluded to prevent oversampling or undersampling. Data were pretreated using a twitter Korean tokenizer and trained by logistic regression, ridge regression, lasso regression, naive bayes classifier, decision tree, and random forest algorithms. The accuracy of the models were compared. Results : As a result of machine learning, ridge regression and naive Bayes classifier showed an accuracy of 0.843, logistic regression and random forest showed an accuracy of 0.804, and decision tree showed an accuracy of 0.745, while lasso regression showed an accuracy of 0.608. Conclusions : Ridge regression and naive Bayes classifier are suitable NLP machine learning models for the Shanghanlun diagnostic system classification.

서 론
방 법
결 과
고 찰
결 론
감사의 글
Reference
[자료제공 : 네이버학술정보]
×