Version: Next

useVerticalOCR

Experimental

The useVerticalOCR hook is currently in an experimental phase. We appreciate feedback from users as we continue to refine and enhance its functionality.

Optical Character Recognition (OCR) is a computer vision technique used to detect and recognize text within images. It is commonly utilized to convert a variety of documents, such as scanned paper documents, PDF files, or images captured by a digital camera, into editable and searchable data. Traditionally, OCR technology has been optimized for recognizing horizontal text, and integrating support for vertical text recognition often requires significant additional effort from developers. To simplify this, we introduce useVerticalOCR, a tool designed to abstract the complexities of vertical text OCR, enabling seamless integration into your applications.

warning

It is recommended to use models provided by us, which are available at our Hugging Face repository. You can also use constants shipped with our library.

API Reference

For detailed API Reference for useVerticalOCR see: useVerticalOCR API Reference.
For all alphabets available in ocr out-of-the-box in React Native ExecuTorch see: OCR Supported Alphabets.

High Level Overview

import { useVerticalOCR, OCR_ENGLISH } from 'react-native-executorch';

function App() {
  const model = useVerticalOCR({
    model: OCR_ENGLISH,
    independentCharacters: true,
  });

  // ...
  for (const ocrDetection of await model.forward('https://url-to-image.jpg')) {
    console.log('Bounding box: ', ocrDetection.bbox);
    console.log('Bounding label: ', ocrDetection.text);
    console.log('Bounding score: ', ocrDetection.score);
  }
  // ...
}

Arguments

useVerticalOCR takes VerticalOCRProps that consists of:

model containing detectorSource, recognizerSource, and language.
An optional flag preventLoad which prevents auto-loading of the model.
An optional flag independentCharacters indicating either to treat characters as independent or words.

You need more details? Check the following resources:

For detailed information about useVerticalOCR arguments check this section: useVerticalOCR arguments.
For all alphabets available in ocr out-of-the-box in React Native ExecuTorch see: OCR Supported Alphabets.
For more information on loading resources, take a look at loading models page.

Returns

useVerticalOCR returns an object called OCRType containing bunch of functions to interact with Vertical OCR models. To get more details please read: OCRType API Reference.

Running the model

To run the model, you can use the forward method. It accepts one argument, which is the image. The image can be a remote URL, a local file URI, or a base64-encoded image (whole URI or only raw base64). The function returns an array of OCRDetection objects. Each object contains coordinates of the bounding box, the text recognized within the box, and the confidence score. For more information, please refer to the reference or type definitions.

Detection object

The detection object is specified as follows:

interface Point {
  x: number;
  y: number;
}

interface OCRDetection {
  bbox: Point[];
  text: string;
  score: number;
}

The bbox property contains information about the bounding box of detected text regions. It is represented as four points, which are corners of detected bounding box. The text property contains the text recognized within detected text region. The score represents the confidence score of the recognized text.

Example

import { useVerticalOCR, OCR_ENGLISH } from 'react-native-executorch';

function App() {
  const model = useVerticalOCR({
    model: OCR_ENGLISH,
    independentCharacters: true,
  });

  const runModel = async () => {
    const ocrDetections = await model.forward('https://url-to-image.jpg');

    for (const ocrDetection of ocrDetections) {
      console.log('Bounding box: ', ocrDetection.bbox);
      console.log('Bounding text: ', ocrDetection.text);
      console.log('Bounding score: ', ocrDetection.score);
    }
  };
}

Alphabet-Specific Recognizers

Each supported alphabet requires its own recognizer model. The built-in constants, such as RECOGNIZER_LATIN_CRNN or RECOGNIZER_CYRILLIC_CRNN, point to specific models trained for a particular alphabet.

For example:

To recognize text in languages using the Latin alphabet (like Polish, or German), use:

RECOGNIZER_LATIN_CRNN

To recognize text in languages using the Cyrillic alphabet (like Russian or Ukrainian), use:

RECOGNIZER_CYRILLIC_CRNN

You need to make sure the recognizer model you pass in recognizerSource matches the alphabet of the language you specify.

Supported languages

For all alphabets available in ocr out-of-the-box in React Native ExecuTorch see: OCR Supported Alphabets.

Supported models

Model	Type
CRAFT	Detector
CRNN	Recognizer

API Reference​

High Level Overview​

Arguments​

Returns​

Running the model​

Detection object​

Example​

Alphabet-Specific Recognizers​

Supported languages​

Supported models​