한강 수위 예측을 위한 데이터 품질 진단 및 개선

최지현; 강진엽; 안현

한국항행학회 한국항행학회논문지 한강 수위 예측을 위한 데이터 품질 진단 및 개선

KCI 등재

한강 수위 예측을 위한 데이터 품질 진단 및 개선

Data Quality Assessment and Improvement for Water Level Prediction of the Han River

최지현 ( Ji-hyun Choi ) , 강진엽 ( Jin-yeop Kang ) , 안현 ( Hyun Ahn )

한국항행학회 2023.02

한국항행학회논문지 27권 1호 133-138(6pages)

UCI I410-ECN-0102-2023-500-001145718

인용하기 URL 복사 보관함 담기

미리보기

초록

최근 급격한 기후 변화 및 온난화로 인한 부작용으로 전 세계적으로 홍수 재해의 빈도 및 피해 규모가 증가하고 있다. 국내의 경우, 한강 수위는 대한민국 수도인 서울의 홍수 재해를 예방하기 위한 주요 관리 대상이다. 본 논문에서는 기계학습 기반의 한강 수위 예측을 개선하기 위해 관련 데이터 품질을 종합적으로 진단하고 이를 개선하기 위한 전처리 방안을 제안한다. 구체적으로는 결 측치 처리와 교차 상관 분석을 통해 데이터를 완전성, 유효성, 그리고 정확성 측면에서 개선한다. 또한, 제안한 데이터 개선 방법이 한강 수위 예측 성능에 미치는 영향을 분석하기 위해 랜덤 포레스트와 LightGBM을 이용하여 성능 평가를 수행한다.

As a side effect of recent rapid climate change and global warming, the frequency and scale of flood disasters are increasing worldwide. In Korea, the water level of the Han River is a major management target for preventing flood disasters in Seoul, the capital of Korea. In this paper, to improve the water level prediction of the Han River based on machine learning, we perform a comprehensive assessment of the quality of related dataset and propose data preprocessing methods to improve it. Specifically, we improve the dataset in terms of completeness, validity, and accuracy through missing value processing and cross-correlation analysis. In addition, we conduct a performance evaluation using random forest and LightGBM to analyze the effect of the proposed data improvement method on the water level prediction performance of the Han River.

키워드

Data preprocessing

Data quality assessment

LightGBM

Random forest

Water-level prediction

Ⅰ. 서 론
Ⅱ. 관련 연구
Ⅲ. 데이터 품질 진단
Ⅳ. 데이터 전처리 및 개선
Ⅴ. 실험 및 결과
Ⅵ. 결 론
Acknowledgments
References

참고문헌 (0)

[자료제공 : 네이버학술정보]