<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">dt</journal-id><journal-title-group><journal-title xml:lang="ru">Цифровая трансформация</journal-title><trans-title-group xml:lang="en"><trans-title>Digital Transformation</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">2522-9613</issn><issn pub-type="epub">2524-2822</issn><publisher><publisher-name>Educational Establishment “Belarusian State University of Informatics and Radioelectronics”</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.38086/2522-9613-2019-1-43-48</article-id><article-id custom-type="elpub" pub-id-type="custom">dt-112</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ТЕХНИЧЕСКИЕ НАУКИ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>TECHNICAL SCIENCES</subject></subj-group></article-categories><title-group><article-title>Модель автоматической классификации и локализации образов</article-title><trans-title-group xml:lang="en"><trans-title>Model of Automatic Classification and Localization of Images</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Серебряная</surname><given-names>Л. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Serebryanaya</surname><given-names>L. V.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Серебряная Лия Валентиновна, кандидат технических наук, доцент, доцент кафедры ПОИТ </p><p>ул. П. Бровки, д. 6, 220013, г. Минск</p></bio><bio xml:lang="en"><p>Candidate of Science (Technology), Associate Professor, Associate Professor of the Department "Software of Information Technologies" </p><p>6 P. Brovka Str., 220013 Minsk, Republic of Belarus</p><p> </p></bio><email xlink:type="simple">l_silver@mail.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Бочкарев</surname><given-names>К. Ю.</given-names></name><name name-style="western" xml:lang="en"><surname>Bochkarev</surname><given-names>K. Y.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Бочкарев Кирилл Юрьевич, магистрант кафедры ИТАС </p><p>ул. П. Бровки, д. 6, 220013, г. Минск</p></bio><bio xml:lang="en"><p>Undergraduate Student of the Department "Information Technologies of Automated Systems" </p><p>6 P. Brovka Str., 220013 Minsk, Republic of Belarus</p><p> </p></bio><email xlink:type="simple">axe777@inbox.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Попитич</surname><given-names>А. Я</given-names></name><name name-style="western" xml:lang="en"><surname>Popitich</surname><given-names>A. Y.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Попитич Александр Яковлевич, магистр технических наук </p><p>ул. П. Бровки, д. 6, 220013, г. Минск</p></bio><bio xml:lang="en"><p>Master of Technical Sciences </p><p> </p><p>6 P. Brovka Str., 220013 Minsk, Republic of Belarus</p></bio><email xlink:type="simple">sasha.popitich@outlook.com</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>УО «Белорусский государственный университет информатики и радиоэлектроники»</institution></aff><aff xml:lang="en"><institution>Belarusian State University of Informatics and Radioelectronics</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2019</year></pub-date><pub-date pub-type="epub"><day>05</day><month>05</month><year>2019</year></pub-date><volume>0</volume><issue>1</issue><fpage>43</fpage><lpage>48</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Серебряная Л.В., Бочкарев К.Ю., Попитич А.Я., 2019</copyright-statement><copyright-year>2019</copyright-year><copyright-holder xml:lang="ru">Серебряная Л.В., Бочкарев К.Ю., Попитич А.Я.</copyright-holder><copyright-holder xml:lang="en">Serebryanaya L.V., Bochkarev K.Y., Popitich A.Y.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://dt.bsuir.by/jour/article/view/112">https://dt.bsuir.by/jour/article/view/112</self-uri><abstract><p>Работа посвящена идентификации образов на изображениях, которая выполняется в результате процедур классификации и локализации. Анализ моделей, методов и алгоритмов показал, что для решения поставленной задачи предпочтительно применять машинное обучение, искусственную нейронную сеть и генетический алгоритм. Предложена архитектура сверточной искусственной нейронной сети, позволяющая решать как задачу классификации, так и задачу локализации образов. Сначала сеть обучается, затем для изображения, подаваемого на ее вход, определяется класс. На заключительном этапе работы сверточной нейронной сети выполняется локализация объектов на изображении. Для этого анализируются выходные значения предпоследнего слоя модели, после чего происходит обход слоев в обратном порядке. Его цель – нахождение на исходном изображении регионов с наибольшим откликом. Комбинированная модель показала приемлемые результаты как по классификации, так и по локализации объектов. Все параметры для работы сети определяются автоматически с помощью генетического алгоритма. Дальнейшее улучшение работы предложенной модели связано с реализацией на ней распределенных вычислений.</p></abstract><trans-abstract xml:lang="en"><p>The work is devoted to the identification of images in pictures, which is performed as a result of the classification and localization procedures. Analysis of models, methods and algorithms has shown that for solving the set task it is preferable to use machine learning, an artificial neural network and a genetic algorithm. The architecture of a convolutional artificial neural network is proposed. It can solve both the problem of classification and the problem of localizing images. First the network is trained, then a class is determined for the image fed to its input. Objects are localized in the image at the final stage of operations of the convolutional neural network. For this, the output values of the penultimate layer of the model are analyzed, after which the layers are traversed in the reverse order. Its goal is to find the regions with the highest response on the source image. The combined model showed acceptable results both in classification and in localization of objects. All parameters for the network are determined automatically using a genetic algorithm. Further improvement of the proposed model results will be performed by implementing distributed computing on it.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>идентификация</kwd><kwd>классификация</kwd><kwd>локализация</kwd><kwd>модель искусственной нейронной сети</kwd><kwd>генетический алгоритм.</kwd></kwd-group><kwd-group xml:lang="en"><kwd>identification</kwd><kwd>classification</kwd><kwd>localization</kwd><kwd>model of artificial neural network</kwd><kwd>genetic algorithm</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Radcliffe, N. J. Genetic set recombination and its application to neural network topology optimization. Technical Report EPCC–TR–91–21 / N. J. Radcliffe. – Edinburgh: University of Edinburgh, 1991. – 250 p.</mixed-citation><mixed-citation xml:lang="en">Radcliffe, N. J. Genetic set recombination and its application to neural network topology optimization. Technical Report EPCC–TR–91–21. Edinburgh: University of Edinburgh, 1991. 250 p.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Stanley, K. О. Evolving Neural Topologies through Augmenting Topologies / K. О. Stanley, R. Miikkulainen // Evolutionary Computation. The MIT Press. – 2002. – Vol. 10 (2). – РP. 99–127.</mixed-citation><mixed-citation xml:lang="en">Stanley K. О. Miikkulainen R. Evolving Neural Topologies through Augmenting Topologies. Evolutionary Computation. The MIT Press, 2002, Vol. 10 (2), pp. 99–127.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Simonyan, K. Deep inside convolutional networks: Visualising image classification models and saliency maps [Electronic resource] / K. Simonyan, A. Vedaldi // International Conference on Learning Representations Workshop. – 2014. – Mode of access: https://arxiv.org/pdf/1312.6034.pdf. – Date of access: 16.03.2019.</mixed-citation><mixed-citation xml:lang="en">Simonyan K., Vedaldi A. Deep inside convolutional networks: Visualising image classification models and saliency maps. International Conference on Learning Representations Workshop. Available at: https://arxiv.org/pdf/1312.6034.pdf (accessed: 16.03.2019).</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Perez, S. Apply genetic algorithm to the learning phase of a neural network [Electronic resource] / S. Perez // Department of Mechanical and Aerospace Engineering University of California. – Irvine, 2005. – Mode of access: https://pdfs.semanticscholar.org/cc48/1cf3f2dfa88fc5fa84cd41d7e9f7f7de4ff2.pdf. – Date of access: 16.03.2019.</mixed-citation><mixed-citation xml:lang="en">Perez S. Apply genetic algorithm to the learning phase of a neural network. Department of Mechanical and Aerospace Engineering University of California, Irvine, 2005. Available at: https://pdfs.semanticscholar.org/cc48/1cf3f2dfa88fc5fa84cd41d7e9f7f7de4ff2.pdf (accessed: 16.03.2019).</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Zhou, B. Learning Deep Features for Discriminative Localization [Electronic resource] / B. Zhou, A. Lapedriza // Computer Science and Artificial Intelligence Laboratory. – MIT, 2014. – Mode of access: https://arxiv.org/pdf/1310.1531.pdf. – Date of access: 16.03.2019.</mixed-citation><mixed-citation xml:lang="en">Zhou. B., Lapedriza A. Learning Deep Features for Discriminative Localization. Computer Science and Artificial Intelligence Laboratory, MIT, 2014. Available at: https://arxiv.org/pdf/1310.1531.pdf (accessed: 16.03.2019).</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Donahue, J. Decaf: A deep convolutional activation feature for generic visual recognition [Electronic resource] / J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, T. Darrell // International Conference on Machine Learning. – 2014. – Mode of access: https://web.njit.edu/~usman/courses/cs698_spring18/RCNN.pdf. – Date of access: 16.03.2019.</mixed-citation><mixed-citation xml:lang="en">Donahue J., Jia Y., Vinyals O., Hoffman J., Zhang N., Tzeng E., Darrell T. Decaf: A deep convolutional activation feature for generic visual recognition. International Conference on Machine Learning, 2014. Available at: https://web.njit.edu/~usman/courses/cs698_spring18/RCNN.pdf (accessed: 16.03.2019).</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Girshick, R. Rich feature hierarchies for accurate object detection and semantic segmentation [Electronic resource] / R. Girshick, J. Donahue, T. Darrell, J. Malik. – CVPR, 2014. – Mode of access: https://papers.nips.cc/paper/4824-imagenetclassification-with-deep-convolutional-neural-networks.pdf. – Date of access: 16.03.2019.</mixed-citation><mixed-citation xml:lang="en">Girshick R., Donahue J., Darrell T., Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR, 2014. Available at: https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neuralnetworks.pdf (accessed: 16.03.2019).</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Krizhevsky, A. Imagenet classification with deep convolutional neural networks / A. Krizhevsky, I. Sutskever, G. E. Hinton // Advances in Neural Information Processing Systems. – 2012. – РP. 1097-1105.</mixed-citation><mixed-citation xml:lang="en">Krizhevsky A., Sutskever I., Hinton G. E. Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 2012, pp. 1097-1105.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
