HUJJATLARDAN JADVALLARNI CHIQARIB OLISH MASALASI, USULLARI VA DASTURIY TA’MINOTLAR TAHLILI

Authors

  • Nishanov Axram Hasanovich Muhammad al-Xorazmiy nomidagi TATU
  • Kenjayev Xamdam Bazarbayevich Muhammad al-Xorazmiy nomidagi TATU

Keywords:

Text Mining, Table Mining, Jadval modeli, ma’lumotlar bazasi, NLP

Abstract

Hozirgi kunda matnli elektron hujjatlarni bir maqsad yo’lida qayta ishlash, jumladan, hujjat matnini mavzusiga ko’ra tasniflash, NLP algoritimlari, kalit so’zlar yordamida zarur axborot birliklarini chiqarib olish borasida juda ham ko’plagan ilmiy amaliy tadqiqot ishlari olib borilgan va bu davom etmoqda. Natijada matnlarni qayta ishlash va zarur axborotlarni chiqarib olishga qaratilgan Text Mining, sun’iy intelekt va ML vositalari asosida dasturiy ta’minotlar amaliyotga joriy etilmoqda. Mazkur maqola hujjatlardan jadvallarni ajratib va undagi ma’lumotlarni chiqarib olish borasidagi ilmiy-amaliy tadqiqotlar tahlil qilinadi.

Author Biography

Kenjayev Xamdam Bazarbayevich, Muhammad al-Xorazmiy nomidagi TATU

Tizimli va amaliy dasturlashtirish kafedrasi t.f.d., professor.

References

Foydalangan adabiyotlar:

Tupaj S., Shi Z., Chang C.H., Alam H. Extracting Tabular Information From Text Files. 1996. http://citeseer.nj.nec.com

Pinto D., McCallum A., Wei X., Croft B. Table extraction using, conditional random fields // 26th Annual Intern. ACM SIGIR, Conf. on Research and Development in Information Retrieval, 2003

Хмельнов А. Е., Шигаров А. О. Метод извлечения таблиц из неформатированного текста // Вычислительные технологии. Том 13, Специальный выпуск 1, 2008. Стр. 93-101

Wright, P., Fox, K.: Presenting information in tables. Appl. Ergon.1(4), 234–242 (1970)

N. Milosevic et al., A framework for information extraction from tables in biomedical literature. IJDAR. 2019. 22:55–78 https://doi.org/10.1007/s10032-019-00317-0

Long, V.: An agent-based approach to table recognition and interpretation. Ph.D. thesis, Macquarie University Sydney, Australia (2010)

Hurst, M.F.: The interpretation of tables in texts. Ph.D. thesis (2000)

Yildiz, B., Kaiser, K., Miksch, S.: pdf2table: a method to extracttable information from pdf files. In: IICAI, pp. 1773–1785 (2005)

Son, J.-W., Lee, J.-A., Park, S.-B., Song, H.-J., Lee, S.-J., Park, S.Y.: Discriminating meaningful web tables from decorative tables using a composite kernel. In: IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT’08, vol. 1, pp. 368–371. IEEE (2008)

Silva, A.: Parts that add up to a whole: a framework for the analysisof tables. Ph.D. thesis, University of Edinburgh (2010)

Hearst, M.A., Divoli, A., Guturu, H., Ksikes, A., Nakov, P.,Wooldridge, M.A., Ye, J.: Biotext search engine: beyond abstract search. Bioinformatics 23(16), 2196–2197 (2007)

Liu, Y.: Tableseer: automatic table extraction, search, and understanding. Ph.D. thesis, The Pennsylvania State University (2009)

Chen, H.-H., Tsai, S.-C., Tsai, J.-H.: Mining tables from largescale HTML texts. In: Proceedings of the 18th Conference on Computational Linguistics, vol. 1, pp. 166–172. Association for Computational Linguistics (2000)

Wei, X., Croft, B., McCallum, A.: Table extraction for answerretrieval. Inf. Retr. 9(5), 589–611 (2006)

A.Shigarov Table understanding using a rule engine // Expert Systems with Applications 42(2):929–937. February 2015.DOI:10.1016/j.eswa.2014.08.045

Шигаров А.О., Бычков И.В.Анализ и интерпретация произвольных таблиц на основе исполнения CRL-правил // Вычислительные технологии Том 20, № 6, 2015. Стр 87-112

Wang, X. Tabular abstraction, editing, and formatting: PhD thesis. Waterloo, Ontario, Canada, University of Waterloo, 1996.

Downloads

Published

2023-08-28

How to Cite

Kenjayev, X., & Nishanov , A. (2023). HUJJATLARDAN JADVALLARNI CHIQARIB OLISH MASALASI, USULLARI VA DASTURIY TA’MINOTLAR TAHLILI. DIGITAL TRANSFORMATION AND ARTIFICIAL INTELLIGENCE, 1(2), 148–157. Retrieved from https://dtai.tsue.uz/index.php/dtai/article/view/v1i224