Fact-checked by Grok 2 weeks ago
References
-
[1]
Image-to-Image Translation with Conditional Adversarial NetworksWe investigate conditional adversarial networks as a general-purpose solution to image-to-image translation problems.
-
[2]
[2101.08629] Image-to-Image Translation: Methods and ApplicationsJan 21, 2021 · Image-to-image translation (I2I) aims to transfer images from a source domain to a target domain while preserving the content representations.
-
[3]
Image analogies | Proceedings of the 28th annual conference on ...This paper describes a new framework for processing images by example, called “image analogies.” The framework involves two stages.
- [4]
-
[5]
All-in-one medical image-to-image translation - ScienceDirect.comAug 18, 2025 · The growing availability of public multi-domain medical image datasets enables training omnipotent image-to-image (I2I) translation models.
- [6]
- [7]
-
[8]
Speak Easy? The Ups and Downs of Travel Translation AppsJul 17, 2024 · Translation apps make travel smoother by breaking down language barriers, boosting cultural exchange, and helping with directions and emergencies.
-
[9]
How AI-Powered Translation Is Transforming the Travel IndustryThe measurable impact of travel translation AI is profound, reshaping the landscape of the tourism industry by enhancing communication and accessibility.
-
[10]
AI Image Translator: Breaking Language Barriers Through Visual ...Oct 23, 2025 · Applications of AI Image Translators in Real Life Tourists can snap photos of foreign signs, menus, or maps and instantly get translations on ...Why Ai Image Translators Are... · How Ai Image Translation... · Applications Of Ai Image...<|separator|>
-
[11]
The Science Behind Image Translation Technology - ImageTranslateNov 7, 2024 · OCR is the engine that enables image translation by extracting text from an image. This is a critical first step, as it involves identifying and ...Missing: scope | Show results with:scope
-
[12]
Top 6 Industries Benefitting From AI Translation TechnologyJul 8, 2025 · From booking websites and travel itineraries to restaurant menus and tourism guides, AI helps localize content to enhance the user experience.
-
[13]
The History of Google Translate (2004-Today): A Detailed AnalysisJul 9, 2024 · The service launched into proper beta on April 28, 2006. One innovation it came with was statistical machine translation. It had been developed ...The Origin of Google Translate... · The Impact of Google...
-
[14]
Microsoft Translator Adds Image Translation to AndroidApr 20, 2016 · Image translation was added to the Microsoft Translator app for iOS in February, and has been available for the Translator apps for Windows and ...
-
[15]
Microsoft's Android Translator app now works on images tooApr 21, 2016 · Image translation has been available in Microsoft's iOS app since February, and on the company's Windows Phone app since 2010. “With the new ...
-
[16]
How Tourism Cultural Events Influence Multicultural Competence ...Sep 10, 2024 · This study explores the impact of tourism cultural events on fostering multicultural competence and their effects on tourism destinations.
-
[17]
What is OCR (Optical Character Recognition)? - Amazon AWSThe two main types of OCR algorithms or software processes that OCR software uses for text recognition are called pattern matching and feature extraction.
-
[18]
What Is Optical Character Recognition (OCR)? - IBMOptical character recognition (OCR) is a technology that uses automated data extraction to quickly convert images of text into a machine-readable format.
-
[19]
How do most OCR algorithms work? - MilvusMost OCR systems involve three core stages: preprocessing, text detection and segmentation, and character recognition with post-processing.
-
[20]
Optical Character Recognition (OCR) - CorpnceFeb 3, 2024 · Techniques such as noise reduction ... A standout feature in modern OCR implementations is the integration of Convolutional Neural Networks (CNNs) ...
-
[21]
Tesseract Open Source OCR Engine (main repository) - GitHubTesseract was originally developed at Hewlett-Packard Laboratories Bristol UK and at Hewlett-Packard Co, Greeley Colorado USA between 1985 and 1994, with some ...Ocr · Wiki · Releases · Tessdata
-
[22]
Detect text in images | Cloud Vision APIOptical Character Recognition (OCR). The Vision API can detect and extract text from images. There are two annotation features that support optical character ...Detect text in files (PDF/TIFF) · Detect handwriting in images · Document AI overview
-
[23]
Evaluate OCR Output Quality with Character Error Rate (CER) and ...Jun 24, 2021 · In this article, we will look at two metrics used to evaluate OCR output, namely Character Error Rate (CER) and Word Error Rate (WER).Missing: typical | Show results with:typical
-
[24]
[PDF] AI Possible Risks & Mitigations - Optical Character RecognitionFor printed documents with clear and legible text, accurate OCR results in the range of 95% to 99% are commonly achievable.
-
[25]
Optical Character Recognition: Important Feature In the Tech WorldJan 8, 2025 · OCR systems can automatically extract specific information, such as names, dates, or amounts, from structured documents like invoices or forms, ...
-
[26]
Analysis of Image Preprocessing and Binarization Methods for OCR ...May 29, 2023 · This method utilizes a convolutional neural network (CNN) to learn the mapping from the original image to the binarized image. Since the method ...
-
[27]
Image Preprocessing for Improving OCR Accuracy - Semantic ScholarThis paper deals with the preprocessing step before text recognition, specifically with images from a digital camera, and confirms importance of image ...
-
[28]
Document Image Skew Detection: Survey and Annotated BibliographyAlgorithms that estimate the angle at which a document image is rotated (called a document's skew) are surveyed and the contributions of individual ...<|separator|>
-
[29]
A Computational Approach to Edge Detection - IEEE XploreThis paper describes a computational approach to edge detection. The success of the approach depends on the definition of a comprehensive set of goals.
-
[30]
[1704.03155] EAST: An Efficient and Accurate Scene Text DetectorIn this work, we propose a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes.
-
[31]
(PDF) Techniques and challenges of automatic text extraction in ...Aug 10, 2025 · The extraction of text from a complex or more colorful images is a challenging problem. Text data present in images contains useful ...
-
[32]
Scene text detection and recognition: recent advances and future ...Jun 22, 2015 · Text detection and recognition in natural scenes have become important and active research topics in computer vision and document analysis.
-
[33]
Neural Machine Translation by Jointly Learning to Align and ... - arXivSep 1, 2014 · The neural machine translation aims at building a single neural network that can be jointly tuned to maximize the translation performance.
-
[34]
[1706.03762] Attention Is All You Need - arXivJun 12, 2017 · View a PDF of the paper titled Attention Is All You Need, by Ashish Vaswani and 7 other authors. View PDF HTML (experimental). Abstract:The ...
-
[35]
Overview and challenges of machine translation for contextually ...Oct 18, 2024 · The preservation of cultural and contextual aspects is vital in machine translation. Translating idiomatic expressions, metaphors, humor, and ...
-
[36]
Pitfalls of Machine Translation: How to Handle Proper NounsGeneral automatic translation systems have weaknesses in translating proper nouns such as personal names, place names, and product names.Missing: preserving idioms
-
[37]
DeepL Translate and Write Pro APIThe DeepL API offers best-in-class AI translation, custom translations, document translation, and handles HTML/XML, making content multilingual.The Developer-Friendly... · Find The Right Plan For You · What Can You Do With The...
-
[38]
Stabilizing Live Speech Translation in Google TranslateJan 26, 2021 · This masking process thus trades latency for stability, without affecting quality. This is very similar to delay-based strategies used in ...
-
[39]
The Ultimate Guide to Real-Time Language Translation - Fora SoftJul 23, 2025 · You can expect real-time translation services to have a latency of a few seconds. It's fast enough for fluid conversations but may lag slightly ...
-
[40]
A Picture is Worth a Thousand (Correct) Captions: A Vision-Guided Judge-Corrector System for Multimodal Machine Translation### Summary of Error Correction in Multimodal Machine Translation
-
[41]
AnyTrans: Translate AnyText in the Image with Large Scale Models### Summary of Post-Processing and Rendering in AnyTrans (arXiv:2406.11432)
-
[42]
Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation### Summary of Rendering Techniques and Evaluation Metrics from arXiv:2308.03024
-
[43]
Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction### Summary of Homography Use for Text Alignment in Document Images
-
[44]
Exploring In-Image Machine Translation with Real-World Background### Summary of Sections from https://arxiv.org/abs/2505.15282
-
[45]
History and the future: Deep-learning-based OCR - advance.aiJan 19, 2021 · Prior to AlexNet's win at the ImageNet, traditional Computer Vision (CV) technology dominated OCR research. At that time, a standard processing ...
-
[46]
Google Translate's 'Word Lens' Feature Now Supports 27 LanguagesJul 30, 2015 · “Word Lens” is activated by opening the Google Translate app, tapping on the camera icon and holding your device in front of the text. The text ...
-
[47]
Recognizing Text in Images | Apple Developer DocumentationVision provides its text-recognition capabilities through VNRecognizeTextRequest, an image-based request type that finds and extracts text in images.
-
[48]
Google's Neural Machine Translation System: Bridging the Gap ...In this work, we present GNMT, Google's Neural Machine Translation system, which attempts to address many of these issues.Missing: adoption | Show results with:adoption
-
[49]
An End-to-End Trainable Neural Network for Image-based ... - arXivJul 21, 2015 · This paper proposes an end-to-end trainable neural network for scene text recognition, integrating feature extraction, sequence modeling, and ...
-
[50]
Google Translate App Gets an Upgrade - The New York TimesJan 14, 2015 · Google has been doing some form of translation since 2001. The Google Translate app now has 90 languages and some 500 million monthly users.
-
[51]
Vision Transformer for Fast and Efficient Scene Text RecognitionMay 18, 2021 · ViTSTR is a scene text recognition model using a vision transformer, achieving 82.6% accuracy at 2.4x speed, with 43.4% fewer parameters.Missing: translation | Show results with:translation
-
[52]
Vision Transformer for Fast and Efficient Scene Text RecognitionIn this paper we propose ViTSTR, an STR with a simple single stage model architecture built on a compute and parameter efficient vision transformer (ViT).
-
[53]
Snapchat Translation Hacks: Making Communication EasierFeb 19, 2023 · Snapchat Translation uses advanced machine learning algorithms to provide real-time translation of text in images, videos, or chat messages.Missing: AR | Show results with:AR
-
[54]
Apple Vision Pro 'Visual Search' Feature Can Identify Items, Copy ...Jun 21, 2023 · With Visual Search, users can use the Vision Pro headset to get information about an item, detect and interact with text in the world around them, copy and ...
-
[55]
Introducing Apple Vision Pro: Benefits and guide for developersJul 25, 2023 · With its built-in cameras, sensors and powerful processors, the Apple Vision Pro can track your movements, recognize objects and even translate ...
-
[56]
GPT-4 - OpenAIMar 14, 2023 · GPT‑4 can accept a prompt of text and images, which—parallel to the text-only setting—lets the user specify any vision or language task.
-
[57]
GPT-4 Turbo and Vision in Localization - Costom.MTExplore OpenAI's GPT-4 update and its transformative role in localization. Affordable translation, speech recognition, and automated testing.
-
[58]
How to use Azure OpenAI GPT-4 Turbo with Vision to describe imagesJan 18, 2024 · First, you need to go to AI Studio to create a deployment of GPT-4 model that is set to version vision-preview. It is also possible to change ...Missing: translation | Show results with:translation
-
[59]
Overview - Out of Vocabulary Scene Text Understanding12 July 2022: Important: Test set was updated to include more diverse data. Please download the new test set. 20 July 2022: Submission of results deadline.
-
[60]
Towards Boosting the Accuracy of Non-Latin Scene Text RecognitionJan 10, 2022 · Over the last decade, generating synthetic datasets with powerful deep learning techniques has tremendously improved scene-text recognition.
-
[61]
A survey on methods, datasets and implementations for scene text ...Jul 6, 2022 · The latest revision of the ICDAR MLT dataset in 2019 included more images and a total of 10 different languages, and an additional synthetic ...
- [62]
-
[63]
Amazon.com : AI Language Translator Device, 2025 Upgraded ...Support photo translation in up to 74 languages, making it easier for you to read menus/signposts/magazines/labels in different languages. Equipped with a flash ...Missing: computing | Show results with:computing
- [64]
- [65]
-
[66]
[PDF] OCR Error Correction Using Statistical Machine TranslationThese tables show that OCR error correction using a word level SMT system can provide a decrease in terms of WER (column Err) from 4.9% to 1.9%, which ...
-
[67]
Automatic Vehicle License Plate Detection from Security Cameras using Deep Learning TechniquesInsufficient relevant content. The provided content snippet from https://ieeexplore.ieee.org/document/10784452 does not contain specific information on environmental factors affecting OCR accuracy, such as lighting, angles, or occlusions. It only includes a title ("Automatic Vehicle License Plate Detection from Security Cameras using Deep Learning Techniques") and metadata, with no detailed text or data available in the excerpt.
-
[68]
[PDF] OCR Improves Machine Translation for Low-Resource LanguagesMay 22, 2022 · The OCR SOTA model accuracy is the highest for European scripts such as Latin and Cyrillic. The OCR accuracy on Latin and Cyrillic is good (< 2 ...<|control11|><|separator|>
-
[69]
Multilingual OCR: Supported Languages and CapabilitiesMultilingual OCR introduces unique challenges such as: Similar-looking characters across scripts (e.g., Latin “a” vs Cyrillic “а”); Bidirectional text (Arabic ...
-
[70]
Technical Analysis of Modern Non-LLM OCR Engines | IntuitionLabsSupported Languages: PaddleOCR supports 80+ languages out of the box. This includes a wide range of scripts: Latin, Chinese (simplified & traditional), Japanese ...Open-Source Ocr Systems And... · Easyocr -- Simple Api, Crnn... · Mmocr -- Openmmlab's Modular...
-
[71]
The State of Multilingual AI - ruder.ioNov 14, 2022 · Current multilingual AI models mostly focus on English and few languages with large resources. Limited data is a major challenge, with most ...
-
[72]
JaidedAI/EasyOCR - GitHubReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.Missing: accuracy | Show results with:accuracy
-
[73]
The Problem of Building OCR Models for Handwritten Devanagari ...Oct 9, 2024 · Addressing the challenges of OCR for handwritten Devanagari text will unlock the potential to digitize vast repositories of handwritten data.Fruitpunch Ai · We Teach Applied Ai Through... · Ocr Challenges<|control11|><|separator|>
-
[74]
Is Google Lens Safe? Your Quick Guide for 2025Loss of privacy is the biggest risk associated with Google Lens. Google stores and processes all user data, including any personal photographs that you might ...
-
[75]
The privacy risks of online file-sharing and translation tools - SchillingsMay 12, 2023 · Explore the hidden privacy, reputation and security risks that free online tools and platforms pose to family offices, small businesses and prominent ...
-
[76]
Are You Accidentally Using Google Translate for Official Documents?Sep 14, 2025 · The main risks of using free translation tools for professional work are severe inaccuracies, breaches of confidentiality, and a lack of ...
-
[77]
Three Things to Help Improve Low-Resource Language AI ... - SlatorMay 1, 2025 · New Stanford University white paper explores causes of, and possible solutions to, issues limiting AI translation for low-resource ...
-
[78]
[PDF] Evaluating Gender Bias in Multilingual Multimodal AI ModelsAug 16, 2024 · 5.1 Why is bias exacerbated in low-resource languages? From the results of experiments conducted in a. “text-to-image retrieval” setting, we ...
-
[79]
Scaling neural machine translation to 200 languages - NatureJun 5, 2024 · The current techniques used for training translation models are difficult to extend to low-resource settings, in which aligned bilingual textual ...
-
[80]
Generative AI Has an Intellectual Property ProblemApr 7, 2023 · There are infringement and rights of use issues, uncertainty about ownership of AI-generated works, and questions about unlicensed content in ...Missing: translation | Show results with:translation
-
[81]
ChatGPT 5 Copyright: The Intellectual Property Challenges for ...Apr 29, 2024 · Translators must verify the sources of data used by AI translation tools to avoid including copyrighted materials and facing infringement claims ...
-
[82]
[PDF] The impact of the General Data Protection Regulation (GDPR) on ...It discusses the tensions and proximities between AI and data protection principles, such as, in particular, purpose limitation and data minimisation.<|separator|>
-
[83]
AI: ensuring GDPR compliance - CNILSep 21, 2022 · In order to comply with the GDPR, an artificial intelligence (AI) system based on the use of personal data must always be developed, trained, ...
-
[84]
AI Regulation Debate Highlights Lack of Data Privacy ProtectionSep 26, 2023 · Lawmakers in both parties acknowledge that they must first resolve a less trendy but more fundamental problem: data privacy and protection.
-
[85]
Lost in AI translation: growing reliance on language apps ...Sep 7, 2023 · Translators say the US immigration system relies on AI-powered translations, without grasping the limits of the tools.
-
[86]
How Dangerous Is Criminal Use of AI Translation for Global Security?Sep 19, 2025 · AI translation is a powerful tool, but in the wrong hands, it makes global crime faster and harder to detect. From phishing scams to extremist ...
-
[87]
Non-Western cultures misrepresented, harmed by generative AI ...Oct 7, 2024 · Penn State and University of Washington researchers found that AI models may cause cultural harms that go beyond surface-level biases.
-
[88]
Flamingo: a Visual Language Model for Few-Shot Learning - arXivApr 29, 2022 · Thanks to their flexibility, Flamingo models can be trained on large-scale multimodal web corpora containing arbitrarily interleaved text ...
-
[89]
Updates to Apple's On-Device and Server Foundation Language ...Jun 9, 2025 · The models have improved tool-use and reasoning capabilities, understand image and text inputs, are faster and more efficient, and are designed to support 15 ...Model Architectures · Training Data · Responsible AiMissing: federated | Show results with:federated
-
[90]
Private Federated Learning In Real World Application – A Case StudyThis paper presents an implementation of machine learning model training using private federated learning (PFL) on edge devices.Missing: image translation
-
[91]
12 Augmented Reality Technology Trends to Watch in 2025 - MobiDevSep 8, 2025 · The Google Translate app lets you point your phone at text in any language and it can display a translated text overlay in real time. Case ...
-
[92]
Multimodal Reinforcement-Learning MT: How Visual Cues Are ...Jun 27, 2025 · On-device translation engines for AR/VR headsets; Multilingual avatar systems that translate in real time based on speech, gestures, and facial ...Key Advantages Of Rl In... · Visual Cues In Live... · 2. Augmented Reality (ar)<|separator|>
-
[93]
PrecisionGAN: enhanced image-to-image translation for preserving ...It is fine-tuned with a hybrid loss function optimized to enhance accuracy and reduce artifacts, even when the training data is imperfect. Our evaluations show ...
-
[94]
[PDF] A Framework Using Generative Adversarial Networks and ...A robust foundation for unsupervised image translation is established by the Cycle GAN model's dual translation methods and cycle consistency. Via ensuring that ...
-
[95]
(PDF) Transfer Learning & GANs for OCR from Engineering DocsJun 11, 2022 · This paper explores deep learning models and OCR methods to effectively extract textual information from engineering documents collected by the ...
-
[96]
[PDF] ERNIE 4.5 Technical ReportJun 29, 2025 · 5-VL (Bai et al., 2025) have extended these abilities to visual data, enabling robust visual reasoning and interpretation. These models have not ...
-
[97]
Qianfan-VL: Domain-Enhanced Universal Vision-Language ModelsSep 19, 2025 · All models are trained entirely on Baidu's Kunlun P800 chips, validating the capability of large-scale AI infrastructure to train SOTA-level ...
-
[98]
All-in-one medical image-to-image translation - PMCAug 11, 2025 · The growing availability of public multi-domain medical image datasets enables training omnipotent image-to-image (I2I) translation models.Missing: emerging | Show results with:emerging
-
[99]
AraTraditions10k bridging cultures with a comprehensive dataset for ...Jun 4, 2025 · AraTraditions10k, a comprehensive and culturally rich dataset, has been introduced to enhance cross-lingual image annotation, retrieval, and tagging.
-
[100]
[PDF] Parallel Corpora for Machine Translation in Low-Resource Indic ...May 3, 2025 · This review provides a comprehensive overview of avail- able parallel corpora for Indic languages, which span diverse linguistic families, ...<|separator|>
-
[101]
Optimal Training Dataset Preparation for AI-Supported ... - MDPIDec 8, 2023 · Adding different fonts, writing styles, and document layouts to fake datasets will help an optical character recognition (OCR) system better ...
-
[102]
[PDF] Translation-Enhanced Multilingual Text-to-Image GenerationJul 9, 2023 · Regarding RQ2, we aim to combine MT-based and zero-shot cross-lingual transfer via fast and parameter-efficient fine-tuning. Inspired by the.
-
[103]
Enhancing Zero-Shot Translation in Multilingual Neural Machine ...Sep 17, 2024 · This simple change significantly improves the quality of zero-shot translations, with an increase of up to 11.1 BLEU points, a measure of ...<|separator|>
-
[104]
OCR post-correction for detecting adversarial text imagesIn this paper, we propose an OCR post-correction algorithm to improve the robustness of OCR-based systems against images with perturbed embedded texts.
-
[105]
Robustness Evaluation of OCR-based Visual Document ... - arXivJun 19, 2025 · We introduce the first unified framework for generating and evaluating multi-modal adversarial attacks on OCR-based VDU models.
-
[106]
A free, user-friendly graphical interface for image translation using ...DeepImageTranslator is designed to be a user-friendly graphical interface tool that allows researchers with no programming experience to easily build, train, ...
-
[107]
Smartcat Image Translation AgentSmartcat's Image Agent achieves over 90% translation accuracy on average in real-world use. It continuously improves by learning from previous human corrections ...
-
[108]
[PDF] Diffusion Model Compression for Image-to-Image TranslationIn this work, we introduce a novel approach to reduce both memory footprint and latency of diffusion models for downstream Image-to-Image (I2I) applica- tions.
-
[109]
Single-Stream Image-to-Image Translation (SSIT): A More Efficient ...Dec 16, 2024 · Now, researchers from Sophia University have developed a model which can reduce the computational requirements needed to run these models, ...Missing: demands | Show results with:demands
-
[110]
Wearable Devices for Real-Time Translation and InterpretationOct 11, 2023 · Wearable translation devices now offer features like noise cancellation, speech recognition, and context understanding, making communication smoother and more ...
-
[111]
Wearable Translator Technology: How to Choose the Best SolutionAug 12, 2025 · A wearable translator is a portable, hands-free device that provides real-time language translation, enabling seamless communication between ...