🪴 Aradinka Digital Garden

Search

Search IconIcon to open search

optical-character-recognition

Last updated Jun 22, 2023

# Computer Vision Research Papers Popularity


# Anatomi OCR


# Mendapatkan Model

Using pre-trained model only

or

Using pre-trained model, then finetune with our data


# Model Backbone


# Previous Work

Done by mas Anshar back in September 2022


# Current Progress


# Current Progress


# EasyOCR Process Breakdown


# OCR Option


# EasyOCR

(+)

(-)


# Deployment EasyOCR Option 1: Using ONNX Runtime

ONNX can be run on mobile devices. Model size: 14mb

Export the model from python

1
torch.onnx.export()

ONNX package on andorid

1
onnxruntime-android package

# Deployment EasyOCR Option 1: Using ONNX Runtime

GitHub Issues:


# Deployment EasyOCR Option 1: Using ONNX Runtime

Todo:


# Option 2: Using PyTorch runtime

Convert the model to TorchScript

GitHub issue:


# Google ML Kit Vision

Guide run in python

(-) Cloud Pricing list


# PaddleOCR

(+)

(-)


# Other Library


# Deploy on the Cloud


# Downstream task


# Usefull repo


# Question


# Notes

Kriteria library:


# Opsi deployment Online


# Opsi Deployment Offline

  1. EasyOCR in ONNX format
    • Perlu reduce model size, process model output
    • create key information extraction native di dalem android
  2. ML Kit Vision
    • Perlu tambahan bikin code key information extraction native di dalem android

# Todo


# Notes

Output boxes: kiri atas, kanan atas kanan bawah, kiri bawah

(lebar, tinggi)

Bounding box

1
2
3
4
{PROVINSi JAWA BARAT=[Point(377, 71), Point(760, 83), Point(758, 119), Point(375, 107)]}
{KOTA BOGOR=[Point(456, 114), Point(677, 116), Point(676, 143), Point(455, 141)]}
{NIK=[Point(55, 150), Point(134, 152), Point(133, 179), Point(54, 177)]}
{32710L0907000013=[Point(253, 156), Point(732, 168), Point(731, 199), Point(252, 187)]}

Center (horizontal, vertikal)

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
{PROVINSi JAWA BARAT=567,95}
{KOTA BOGOR=566,128}
{NIK=94,164}
{32710L0907000013=492,177}
{Nama=91,224}
{AZKA MUHAMMAD RADINKA=496,233}
{PURBAE=360,258}
{TempatgtlahirDEPOK, 09-07-2000=311,290}
{Jenis kelamin=137,312}
{Alamat=97,341}
{RT/RW=160,371}
{Kel/Desa=170,402}
{Kecamatan KOTA BOGORUTARA=355,436}
{Agama=91,462}
{Status Perkawinan: BELUM KAWIN=274,494}
{Pekerjaan=110,521}
{Kewarganegaraan: WNI=200,553}
{Berlaku Hingga=141,580}
{LAKI-LAK=358,320}
{PERUMAHAN TAMAN SERUN=505,349}
{008/004E=359,377}
{TANAHBARU=402,408}
{Gol. Darah O=633,324}
{ISLAM=341,463}
{PELAJAR/MAHASISWA=456,526}
{KOTABOGOR=899,510}
{05-07-201B=900,534}
{SEUMUR HIDUP=402,585}

# OCR Progress