Bidirectional Transformer for Android Based Image Icon Text Generation
- DOI
- 10.2991/978-94-6463-040-4_95How to use a DOI?
- Keywords
- Image Caption; Natural Language Processing; Encoder-Decoder Framework
- Abstract
The image caption task is an important manifestation of the fusion of computer vision and natural language processing development in deep learning. The Image Caption task, which is an advanced kind of image comprehension, can effectively grasp image information and produce accurate and concise natural language descriptions to users. It has gotten a lot of attention in the subject of art intelligence, and it has a lot of uses in the field of assisting visually impaired guides and human-computer interaction. This research primarily presents a deep learning-based solution for completing the natural language generation task based on images and symbols in Android. The encoder-decoder framework is used as the core structure to help visually impaired persons interact with mobile phones.
- Copyright
- © 2023 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Chuyi Yu AU - Ying Ma AU - Jianmin Li PY - 2022 DA - 2022/12/27 TI - Bidirectional Transformer for Android Based Image Icon Text Generation BT - Proceedings of the 2022 3rd International Conference on Artificial Intelligence and Education (IC-ICAIE 2022) PB - Atlantis Press SP - 627 EP - 632 SN - 2589-4900 UR - https://doi.org/10.2991/978-94-6463-040-4_95 DO - 10.2991/978-94-6463-040-4_95 ID - Yu2022 ER -