Language-Led Visual Grounding and Future Possibilities

Authors: Sui, Z., Zhou, M., Feng, Z., Stefanidis, A. and Jiang, N.

Journal: Electronics (Switzerland)

Volume: 12

Issue: 14

eISSN: 2079-9292

DOI: 10.3390/electronics12143142

Abstract:

In recent years, with the rapid development of computer vision technology and the popularity of intelligent hardware, as well as the increasing demand for human–machine interaction in intelligent products, visual localization technology can help machines and humans to recognize and locate objects, thereby promoting human–machine interaction and intelligent manufacturing. At the same time, human–machine interaction is constantly evolving and improving, becoming increasingly intelligent, humanized, and efficient. In this article, a new visual localization model is proposed, and a language validation module is designed to use language information as the main information to increase the model’s interactivity. In addition, we also list the future possibilities of visual localization and provide two examples to explore the application and optimization direction of visual localization and human–machine interaction technology in practical scenarios, providing reference and guidance for relevant researchers and promoting the development and application of visual localization and human–machine interaction technology.

https://eprints.bournemouth.ac.uk/38909/

Source: Scopus

Language-Led Visual Grounding and Future Possibilities

Authors: Sui, Z., Zhou, M., Feng, Z., Stefanidis, A. and Jiang, N.

Journal: ELECTRONICS

Volume: 12

Issue: 14

eISSN: 2079-9292

DOI: 10.3390/electronics12143142

https://eprints.bournemouth.ac.uk/38909/

Source: Web of Science (Lite)

Language-Led Visual Grounding and Future Possibilities

Authors: Sui, Z., Zhou, M., Feng, Z., Stefanidis, A. and Jiang, N.

Journal: Electronics

Volume: 12

Issue: 14

ISSN: 2079-9292

Abstract:

In recent years, with the rapid development of computer vision technology and the popularity of intelligent hardware, as well as the increasing demand for human–machine interaction in intelligent products, visual localization technology can help machines and humans to recognize and locate objects, thereby promoting human–machine interaction and intelligent manufacturing. At the same time, human–machine interaction is constantly evolving and improving, becoming increasingly intelligent, humanized, and efficient. In this article, a new visual localization model is proposed, and a language validation module is designed to use language information as the main information to increase the model’s interactivity. In addition, we also list the future possibilities of visual localization and provide two examples to explore the application and optimization direction of visual localization and human–machine interaction technology in practical scenarios, providing reference and guidance for relevant researchers and promoting the development and application of visual localization and human–machine interaction technology.

https://eprints.bournemouth.ac.uk/38909/

Source: BURO EPrints