Augmented Reality Application for Optical Character Recognition

Nandan Kumar, N. and Wan Nor Al-Ashekin, Wan Husin (2024) Augmented Reality Application for Optical Character Recognition. INTI JOURNAL, 2024 (18). pp. 1-9. ISSN e2600-7320

[img] Text
ij2024_18.pdf - Published Version
Available under License Creative Commons Attribution.

Download (373kB)
Official URL: https://intijournal.intimal.edu.my

Abstract

Augmented Reality (AR) technology has become popular for improving user experiences by superimposing virtual features on the real world. OCR is another recent method for extracting text from photos or real-world items. AR and OCR are combined in a new software that provides an immersive and engaging experience. The proposed AR-based OCR system uses Firebase as a backend. Users can point their smartphones at papers, signs, or other textual material to use AR, which will automatically recognize and extract the content. This extracted content can be translated, converted to text-to-speech, or shared on social media. Storage and management of recognized text data is reliable and scalable with the Firebase database connector. The Firebase Realtime Database can immediately sync extracted text across several devices for user collaboration and sharing. Firebase Authentication can authenticate and authorize users for safe OCR access. The program uses image processing for text extraction, OCR models for accurate recognition, and AR frameworks like ARCore (Android) and ARKit (iOS). The application will be linked to the Firebase backend using SDKs and APIs for real-time data synchronization and safe data storage. The AR-based OCR application has great promise in education, logistics, retail, and other industries. It can extract text from physical documents, increase accessibility for visually challenged people, and translate foreign language text in real time. Firebase's backend database solution meets the application's needs for scalability, dependability, and data security.

Item Type: Article
Uncontrolled Keywords: Augmented Reality, Optical Character Recognition, OCR models, Image Processing, Text Extraction
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
T Technology > T Technology (General)
Depositing User: Unnamed user with email masilah.mansor@newinti.edu.my
Date Deposited: 24 Jul 2024 09:20
Last Modified: 24 Jul 2024 09:20
URI: http://eprints.intimal.edu.my/id/eprint/1950

Actions (login required)

View Item View Item