| OCRSpace | ocrspace | app | This component uses the OCRSpace app. |
| Language | language | string | Select a value from the drop down menu:{
"label": "Arabic",
"value": "ara"
}{
"label": "Bulgarian",
"value": "bul"
}{
"label": "Chinese (Simplified)",
"value": "chs"
}{
"label": "Chinese (Traditional)",
"value": "cht"
}{
"label": "Croatian",
"value": "hrv"
}{
"label": "Czech",
"value": "cze"
}{
"label": "Danish",
"value": "dan"
}{
"label": "Dutch",
"value": "dut"
}{
"label": "English",
"value": "eng"
}{
"label": "Finnish",
"value": "fin"
}{
"label": "French",
"value": "fre"
}{
"label": "German",
"value": "ger"
}{
"label": "Greek",
"value": "gre"
}{
"label": "Hungarian",
"value": "hun"
}{
"label": "Korean",
"value": "kor"
}{
"label": "Italian",
"value": "ita"
}{
"label": "Japanese",
"value": "jpn"
}{
"label": "Polish",
"value": "pol"
}{
"label": "Portuguese",
"value": "por"
}{
"label": "Russian",
"value": "rus"
}{
"label": "Slovenian",
"value": "slv"
}{
"label": "Spanish",
"value": "spa"
}{
"label": "Swedish",
"value": "swe"
}{
"label": "Turkish",
"value": "tur"
} |
| Is Overlay Required | isOverlayRequired | boolean | If true, returns the coordinates of the bounding boxes for each word. If false, the OCR'ed text is returned only as a text block (this makes the JSON reponse smaller). Overlay data can be used, for example, to show text over the image
|
| Detect Orientation | detectOrientation | boolean | If set to true, the api autorotates the image correctly and sets the TextOrientation parameter in the JSON response. If the image is not rotated, then TextOrientation=0, otherwise it is the degree of the rotation, e. g. "270".
|
| Scale | scale | boolean | If set to true, the api does some internal upscaling. This can improve the OCR result significantly, especially for low-resolution PDF scans. Note that the front page demo uses scale=true, but the API uses scale=false by default. See also this OCR forum post.
|
| Is Table | isTable | boolean | If set to true, the OCR logic makes sure that the parsed text result is always returned line by line. This switch is recommended for table OCR, receipt OCR, invoice processing and all other type of input documents that have a table like structure.
|
| OCR Engine | ocrEngine | string | Select a value from the drop down menu:{
"label": "OCR Engine 1",
"value": "1"
}{
"label": "OCR Engine 2",
"value": "2"
} |
| PDF File | file | string | The URL of the PDF file or the path to the file saved to the /tmp directory (e.g. /tmp/example.pdf) to process. See the documentation
|
| N/A | syncDir | dir | This component uses dir to share files between component executions. |