References

Цифровая трансформация

Digital Transformation

2522-96132524-2822

Educational Establishment “Belarusian State University of Informatics and Radioelectronics”

10.38086/2522-9613-2019-1-43-48

dt-112

Research Article

ТЕХНИЧЕСКИЕ НАУКИ

TECHNICAL SCIENCES

Модель автоматической классификации и локализации образов

Model of Automatic Classification and Localization of Images

Серебряная

Л. В.

Serebryanaya

L. V.

Серебряная Лия Валентиновна, кандидат технических наук, доцент, доцент кафедры ПОИТ

ул. П. Бровки, д. 6, 220013, г. Минск

Candidate of Science (Technology), Associate Professor, Associate Professor of the Department "Software of Information Technologies"

6 P. Brovka Str., 220013 Minsk, Republic of Belarus

l_silver@mail.ru

Бочкарев

К. Ю.

Bochkarev

K. Y.

Бочкарев Кирилл Юрьевич, магистрант кафедры ИТАС

ул. П. Бровки, д. 6, 220013, г. Минск

Undergraduate Student of the Department "Information Technologies of Automated Systems"

6 P. Brovka Str., 220013 Minsk, Republic of Belarus

axe777@inbox.ru

Попитич

А. Я

Popitich

A. Y.

Попитич Александр Яковлевич, магистр технических наук

ул. П. Бровки, д. 6, 220013, г. Минск

Master of Technical Sciences

6 P. Brovka Str., 220013 Minsk, Republic of Belarus

sasha.popitich@outlook.com

УО «Белорусский государственный университет информатики и радиоэлектроники»Belarusian State University of Informatics and Radioelectronics

2019

05052019

014348

2019

Серебряная Л.В., Бочкарев К.Ю., Попитич А.Я.

Serebryanaya L.V., Bochkarev K.Y., Popitich A.Y.

Данная работа распространяется под лицензией Creative Commons Attribution 4.0.

This work is licensed under a Creative Commons Attribution 4.0 License.

https://dt.bsuir.by/jour/article/view/112

Работа посвящена идентификации образов на изображениях, которая выполняется в результате процедур классификации и локализации. Анализ моделей, методов и алгоритмов показал, что для решения поставленной задачи предпочтительно применять машинное обучение, искусственную нейронную сеть и генетический алгоритм. Предложена архитектура сверточной искусственной нейронной сети, позволяющая решать как задачу классификации, так и задачу локализации образов. Сначала сеть обучается, затем для изображения, подаваемого на ее вход, определяется класс. На заключительном этапе работы сверточной нейронной сети выполняется локализация объектов на изображении. Для этого анализируются выходные значения предпоследнего слоя модели, после чего происходит обход слоев в обратном порядке. Его цель – нахождение на исходном изображении регионов с наибольшим откликом. Комбинированная модель показала приемлемые результаты как по классификации, так и по локализации объектов. Все параметры для работы сети определяются автоматически с помощью генетического алгоритма. Дальнейшее улучшение работы предложенной модели связано с реализацией на ней распределенных вычислений.

The work is devoted to the identification of images in pictures, which is performed as a result of the classification and localization procedures. Analysis of models, methods and algorithms has shown that for solving the set task it is preferable to use machine learning, an artificial neural network and a genetic algorithm. The architecture of a convolutional artificial neural network is proposed. It can solve both the problem of classification and the problem of localizing images. First the network is trained, then a class is determined for the image fed to its input. Objects are localized in the image at the final stage of operations of the convolutional neural network. For this, the output values of the penultimate layer of the model are analyzed, after which the layers are traversed in the reverse order. Its goal is to find the regions with the highest response on the source image. The combined model showed acceptable results both in classification and in localization of objects. All parameters for the network are determined automatically using a genetic algorithm. Further improvement of the proposed model results will be performed by implementing distributed computing on it.

идентификацияклассификациялокализациямодель искусственной нейронной сетигенетический алгоритм.

identificationclassificationlocalizationmodel of artificial neural networkgenetic algorithm

References1

Radcliffe, N. J. Genetic set recombination and its application to neural network topology optimization. Technical Report EPCC–TR–91–21 / N. J. Radcliffe. – Edinburgh: University of Edinburgh, 1991. – 250 p.

Radcliffe, N. J. Genetic set recombination and its application to neural network topology optimization. Technical Report EPCC–TR–91–21. Edinburgh: University of Edinburgh, 1991. 250 p.

Stanley, K. О. Evolving Neural Topologies through Augmenting Topologies / K. О. Stanley, R. Miikkulainen // Evolutionary Computation. The MIT Press. – 2002. – Vol. 10 (2). – РP. 99–127.

Stanley K. О. Miikkulainen R. Evolving Neural Topologies through Augmenting Topologies. Evolutionary Computation. The MIT Press, 2002, Vol. 10 (2), pp. 99–127.

Simonyan, K. Deep inside convolutional networks: Visualising image classification models and saliency maps [Electronic resource] / K. Simonyan, A. Vedaldi // International Conference on Learning Representations Workshop. – 2014. – Mode of access: https://arxiv.org/pdf/1312.6034.pdf. – Date of access: 16.03.2019.

Simonyan K., Vedaldi A. Deep inside convolutional networks: Visualising image classification models and saliency maps. International Conference on Learning Representations Workshop. Available at: https://arxiv.org/pdf/1312.6034.pdf (accessed: 16.03.2019).

Perez, S. Apply genetic algorithm to the learning phase of a neural network [Electronic resource] / S. Perez // Department of Mechanical and Aerospace Engineering University of California. – Irvine, 2005. – Mode of access: https://pdfs.semanticscholar.org/cc48/1cf3f2dfa88fc5fa84cd41d7e9f7f7de4ff2.pdf. – Date of access: 16.03.2019.

Perez S. Apply genetic algorithm to the learning phase of a neural network. Department of Mechanical and Aerospace Engineering University of California, Irvine, 2005. Available at: https://pdfs.semanticscholar.org/cc48/1cf3f2dfa88fc5fa84cd41d7e9f7f7de4ff2.pdf (accessed: 16.03.2019).

Zhou, B. Learning Deep Features for Discriminative Localization [Electronic resource] / B. Zhou, A. Lapedriza // Computer Science and Artificial Intelligence Laboratory. – MIT, 2014. – Mode of access: https://arxiv.org/pdf/1310.1531.pdf. – Date of access: 16.03.2019.

Zhou. B., Lapedriza A. Learning Deep Features for Discriminative Localization. Computer Science and Artificial Intelligence Laboratory, MIT, 2014. Available at: https://arxiv.org/pdf/1310.1531.pdf (accessed: 16.03.2019).

Donahue, J. Decaf: A deep convolutional activation feature for generic visual recognition [Electronic resource] / J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, T. Darrell // International Conference on Machine Learning. – 2014. – Mode of access: https://web.njit.edu/~usman/courses/cs698_spring18/RCNN.pdf. – Date of access: 16.03.2019.

Donahue J., Jia Y., Vinyals O., Hoffman J., Zhang N., Tzeng E., Darrell T. Decaf: A deep convolutional activation feature for generic visual recognition. International Conference on Machine Learning, 2014. Available at: https://web.njit.edu/~usman/courses/cs698_spring18/RCNN.pdf (accessed: 16.03.2019).

Girshick, R. Rich feature hierarchies for accurate object detection and semantic segmentation [Electronic resource] / R. Girshick, J. Donahue, T. Darrell, J. Malik. – CVPR, 2014. – Mode of access: https://papers.nips.cc/paper/4824-imagenetclassification-with-deep-convolutional-neural-networks.pdf. – Date of access: 16.03.2019.

Girshick R., Donahue J., Darrell T., Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR, 2014. Available at: https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neuralnetworks.pdf (accessed: 16.03.2019).

Krizhevsky, A. Imagenet classification with deep convolutional neural networks / A. Krizhevsky, I. Sutskever, G. E. Hinton // Advances in Neural Information Processing Systems. – 2012. – РP. 1097-1105.

Krizhevsky A., Sutskever I., Hinton G. E. Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 2012, pp. 1097-1105.

The authors declare that there are no conflicts of interest present.