<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">dt</journal-id><journal-title-group><journal-title xml:lang="ru">Цифровая трансформация</journal-title><trans-title-group xml:lang="en"><trans-title>Digital Transformation</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">2522-9613</issn><issn pub-type="epub">2524-2822</issn><publisher><publisher-name>Educational Establishment “Belarusian State University of Informatics and Radioelectronics”</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.38086/2522-9613-2019-2-46-52</article-id><article-id custom-type="elpub" pub-id-type="custom">dt-172</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ТЕХНИЧЕСКИЕ НАУКИ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>TECHNICAL SCIENCES</subject></subj-group></article-categories><title-group><article-title>Интеллектуальный анализ текстовой информации в специализированных областях в системе электронного правительства</article-title><trans-title-group xml:lang="en"><trans-title>Intellectual Analysis of Textual Information in Domain Fields in the System of e-Government</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Макаревич</surname><given-names>Т. И.</given-names></name><name name-style="western" xml:lang="en"><surname>Makarevich</surname><given-names>T. I.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Магистр филологических наук, старший преподаватель кафедры английского языка гуманитарных специальностей факультета международных отношений БГУ , магистрант 1 курса специальности «Электронное правительство» факультета инновационной подготовки Академии управления при Президенте Республики Беларусь</p><p>ул. Ленинградская, д. 20, 220030, г. Минск; ул. Московская, д. 17, 220007, г. Минск</p></bio><bio xml:lang="en"><p>Master of Philological sciences, Senior Lecturer of the Department of English for Humanities, Faculty of International Relations, BSU;  1st year postgraduate student, specialty “e-Government”</p><p>20 Leningradskaya Str., 220030 Minsk;  17 Moskovskaya Str., 220007 Minsk, Republic of Belarus </p></bio><email xlink:type="simple">t_makarevich@mail.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Белорусский государственный университет; &#13;
Академия управления при Президенте Республики Беларусь</institution></aff><aff xml:lang="en"><institution>Belarusian State University; &#13;
Academy of Public Administration under the aegis of the President&#13;
of the Republic of Belaru</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2019</year></pub-date><pub-date pub-type="epub"><day>05</day><month>08</month><year>2019</year></pub-date><volume>0</volume><issue>2</issue><fpage>46</fpage><lpage>52</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Макаревич Т.И., 2019</copyright-statement><copyright-year>2019</copyright-year><copyright-holder xml:lang="ru">Макаревич Т.И.</copyright-holder><copyright-holder xml:lang="en">Makarevich T.I.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://dt.bsuir.by/jour/article/view/172">https://dt.bsuir.by/jour/article/view/172</self-uri><abstract><p>Настоящая статья посвящена изучению применения технологии text mining в научных исследованиях как одного из методов интеллектуального анализа текстовой информации в специализированных областях системы электронного правительства. Значимость работы объясняется тем, что в настоящее время в Республике Беларусь не существует исследований, аналогичных проведенному. Продемонстрировано применение программного пакета Rapid Miner и языка R как сред для глубинного анализа текста. Оптимальной формой изучения предметных онтологий признано так называемое концептуальное индексирование. Обозначены оптимальные подходы к его рассмотрению: формальный и лингвистический. Выявлены проблемы избыточности и многозначности слов. Разработка данной проблематики нацелена на согласование разрозненности русскоязычных и иноязычных терминологических систем специализированных онтологий на основе технологий искусственного интеллекта.</p></abstract><trans-abstract xml:lang="en"><p>The given paper considers application of data mining technology in scientific research as one of intellectual analysis methods in the domain field of e-Government. The topicality of the issue is stipulated by the current absence of the researches of the kind in the Republic of Belarus. The paper illustrates how the programme package Rapid Miner and the language R have been applied in text mining. Concept indexing has been admitted as the most resultative form of analyzing domain field ontologies. Formal and linguistic approaches are found most effective in analyzing domain field ontologies. The paper identifies the problems of word redundancy and word polysemy. The prognosis for the further research investigation is in interconnectivity of specialized ontologies studying heterogeneous terms on the basis of artificial intelligence (AI).</p></trans-abstract><kwd-group xml:lang="ru"><kwd>терминологическая система</kwd><kwd>специализированные терминологические словари</kwd><kwd>информационно-поисковый тезаурус</kwd><kwd>онтология</kwd><kwd>предметная область</kwd><kwd>обработка текстовой информации</kwd><kwd>частотный анализ</kwd><kwd>глубинный анализ текста</kwd><kwd>язык R</kwd><kwd>Rapid Miner</kwd><kwd>электронное правительство</kwd></kwd-group><kwd-group xml:lang="en"><kwd>terminilogical system</kwd><kwd>specialized dictionaries</kwd><kwd>information retrieval thesaurus</kwd><kwd>ontology</kwd><kwd>domain area</kwd><kwd>text information processing</kwd><kwd>analysis in the frequency domain</kwd><kwd>text mining</kwd><kwd>the computer language R</kwd><kwd>Rapid Miner</kwd><kwd>e-Government</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Добров, Б. В. Онтологии и тезаурусы: модели, инструменты, приложения / Б. В. Добров, В. В. Иванов, Н. В. Лукашевич, В. Д. Соловьев. – М.: Бином. Лаборатория знаний, 2009. – 173 с.</mixed-citation><mixed-citation xml:lang="en">Dobrov B. V. Ontologii i tezaurusy: modeli, instrumenty, prilozheniya [Ontologies and Thesauruses: Models, Instruments, Applications]. Мoscow, Binom. Laboratoriya znanij, 2009. 173 p. (in Russian).</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Макаревич, Т. И. English for ICT Students = Английский язык для изучающих информационно-коммуникационные технологии: пособие: в 2-х ч. / Т. И. Макаревич, И. И. Макаревич. – Минск: Акад. упр. при Президенте Респ. Беларусь, 2012. – 382 с.</mixed-citation><mixed-citation xml:lang="en">Makarevich T. I., Makarevich I. I. English for ICT Students: textbook: in 2 parts. Minsk: Academy of Public Administration under the aegis of the President of the Republic of Belarus, 2012. 382 p.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Piatetsky-Shapiro, G. Knowledge Discovery in Databases / G. Piatetsky-Shapiro, W. Frawley. – New York: AAAI/MIT Press, 1991. – 168 p.</mixed-citation><mixed-citation xml:lang="en">Piatetsky-Shapiro G., Frawley W. Knowledge Discovery in Databases. NY: AAAI/MIT Press, 1991. 168 p.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Ландэ, Д. В. Подход к созданию терминологических онтологий / Д. В. Ландэ, А. А. Снарский // Онтология проектирования. – 2014. – № 2(12). – C. 83–91.</mixed-citation><mixed-citation xml:lang="en">Lande D. V. An Approach to Creating Terminological Ontologies. Ontologiya proektirovaniya [Ontology Project Development], 2014, № 2(12), pp. 83–91 (in Russian).</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Hofmann, M. RapidMiner: Data Mining Use Cases and Business Analytics Applications / M. Hofmann, R. Klinkenberg. – New York: Chapman &amp; Hall/CRC Data Mining and Knowledge Discovery Series, 2013. – 525 p.</mixed-citation><mixed-citation xml:lang="en">Hofmann M., Klinkenberg R. RapidMiner: Data Mining Use Cases and Business Analytics Applications. Chapman &amp; Hall/CRC Data Mining and Knowledge Discovery Series. 1st ed, 2013. 525 p.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Тезаурус информационно-поисковый одноязычный: Правила разработки: структура, состав и форма представления // ГОСТ 7.25.-2001. Система стандартов по информации, библиотечному и издательскому делу. – Минск: Межгосударственный совет по стандартизации, метрологии и сертификации, 2001.</mixed-citation><mixed-citation xml:lang="en">Tezaurus informatsionno-poiskovyi odnoyazychnyi: Pravila razrabotki: sruktura, sostav I forma predstavleniya // GOST 7.25.- 2001. Sistema standartov po informatsii, bibliotechomu i izdatelskomu delu [Thesaurus Information-Retrieval Monolingual: Rules for Development: Structure, Content and Display Format // GOST 7.25.-2001. System of Standards in Information, Bibliography and Publishing]. Minsk, CIS Council for Standardization, Metrology and Certification Intergovernmental, 2001 (in Russian).</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Guarino, N. Ontologies and Knowledge Bases: Towards a Terminological Clarification / N. Guarino, P. Giaretta // Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing. – Amsterdam: IOS Press, 1995. – P. 25–32.</mixed-citation><mixed-citation xml:lang="en">Guarino N., Giaretta P. Ontologies and Knowledge Bases: Towards a Terminological Clarification. Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing. Amsterdam, IOS Press, 1995, pp. 57–70.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Sowa, J. Knowledge Representation: Logical, Philosophical, and Computational Foundations / J. Sowa // Brooks Cole Publishing Co., Pacific Grove, CA. 2000. – V. 45(2). – P. 61–65.</mixed-citation><mixed-citation xml:lang="en">Sowa J. Knowledge Representation: Logical, Philosophical, and Computational Foundations. Brooks Cole Publishing Co., Pacific Grove, CA, 2000, V. 45(2), pp. 61 – 65.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Ihaka, R. R: A Language for Data Analysis and Graphics / R. Ihaka, R. Gentleman // Journal of Computational and Graphical Statistics. – 1996. – Vol. 5. – № 3. – P. 299–314.</mixed-citation><mixed-citation xml:lang="en">Ihaka R., Gentleman R. A Language for Data Analysis and Graphics. Journal of Computational and Graphical Statistics, 1996, Vol. 5, No 3, pp. 299–314.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Matloff, N. The Art of R Programming. A Tour of Statistical Software Design / N. Matloff. – San Francisco: No Starch Press. – 2011. – 316 p.</mixed-citation><mixed-citation xml:lang="en">Matloff N. The Art of R Programming. A Tour of Statistical Software Design. San Francisco, No Starch Press, 2011. 316 p.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
