HUJJATLARDAN JADVALLARNI CHIQARIB OLISH MASALASI, USULLARI VA DASTURIY TA’MINOTLAR TAHLILI
Keywords:
Text Mining, Table Mining, Jadval modeli, ma’lumotlar bazasi, NLPAbstract
Hozirgi kunda matnli elektron hujjatlarni bir maqsad yo’lida qayta ishlash, jumladan, hujjat matnini mavzusiga ko’ra tasniflash, NLP algoritimlari, kalit so’zlar yordamida zarur axborot birliklarini chiqarib olish borasida juda ham ko’plagan ilmiy amaliy tadqiqot ishlari olib borilgan va bu davom etmoqda. Natijada matnlarni qayta ishlash va zarur axborotlarni chiqarib olishga qaratilgan Text Mining, sun’iy intelekt va ML vositalari asosida dasturiy ta’minotlar amaliyotga joriy etilmoqda. Mazkur maqola hujjatlardan jadvallarni ajratib va undagi ma’lumotlarni chiqarib olish borasidagi ilmiy-amaliy tadqiqotlar tahlil qilinadi.
References
Foydalangan adabiyotlar:
Tupaj S., Shi Z., Chang C.H., Alam H. Extracting Tabular Information From Text Files. 1996. http://citeseer.nj.nec.com
Pinto D., McCallum A., Wei X., Croft B. Table extraction using, conditional random fields // 26th Annual Intern. ACM SIGIR, Conf. on Research and Development in Information Retrieval, 2003
Хмельнов А. Е., Шигаров А. О. Метод извлечения таблиц из неформатированного текста // Вычислительные технологии. Том 13, Специальный выпуск 1, 2008. Стр. 93-101
Wright, P., Fox, K.: Presenting information in tables. Appl. Ergon.1(4), 234–242 (1970)
N. Milosevic et al., A framework for information extraction from tables in biomedical literature. IJDAR. 2019. 22:55–78 https://doi.org/10.1007/s10032-019-00317-0
Long, V.: An agent-based approach to table recognition and interpretation. Ph.D. thesis, Macquarie University Sydney, Australia (2010)
Hurst, M.F.: The interpretation of tables in texts. Ph.D. thesis (2000)
Yildiz, B., Kaiser, K., Miksch, S.: pdf2table: a method to extracttable information from pdf files. In: IICAI, pp. 1773–1785 (2005)
Son, J.-W., Lee, J.-A., Park, S.-B., Song, H.-J., Lee, S.-J., Park, S.Y.: Discriminating meaningful web tables from decorative tables using a composite kernel. In: IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT’08, vol. 1, pp. 368–371. IEEE (2008)
Silva, A.: Parts that add up to a whole: a framework for the analysisof tables. Ph.D. thesis, University of Edinburgh (2010)
Hearst, M.A., Divoli, A., Guturu, H., Ksikes, A., Nakov, P.,Wooldridge, M.A., Ye, J.: Biotext search engine: beyond abstract search. Bioinformatics 23(16), 2196–2197 (2007)
Liu, Y.: Tableseer: automatic table extraction, search, and understanding. Ph.D. thesis, The Pennsylvania State University (2009)
Chen, H.-H., Tsai, S.-C., Tsai, J.-H.: Mining tables from largescale HTML texts. In: Proceedings of the 18th Conference on Computational Linguistics, vol. 1, pp. 166–172. Association for Computational Linguistics (2000)
Wei, X., Croft, B., McCallum, A.: Table extraction for answerretrieval. Inf. Retr. 9(5), 589–611 (2006)
A.Shigarov Table understanding using a rule engine // Expert Systems with Applications 42(2):929–937. February 2015.DOI:10.1016/j.eswa.2014.08.045
Шигаров А.О., Бычков И.В.Анализ и интерпретация произвольных таблиц на основе исполнения CRL-правил // Вычислительные технологии Том 20, № 6, 2015. Стр 87-112
Wang, X. Tabular abstraction, editing, and formatting: PhD thesis. Waterloo, Ontario, Canada, University of Waterloo, 1996.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Nishanov Axram Hasanovich, Kenjayev Xamdam Bazarbayevich
This work is licensed under a Creative Commons Attribution 4.0 International License.