PDF Scrape > OCR > JSON -- 2

Cerrado Publicado hace 6 años Pagado a la entrega
Cerrado Pagado a la entrega

Our project involves the following top level processes.

1. Scrape a site with PDF Files. This will require some intelligent scraping and masking process either with Proxy or randomly. We need not to get blocked.

2. A. Take the PDF and extract Text. B. Use OCR to extract Image file Text and Digits that are masked in the PDF on purpose. The PDF has both Text and Images as attached.

3. Take the results and create our JSON file format and send to the endpoint on schedule.

This project requires you to start now.

GMT+9

JSON OCR PDF Extracción de datos web XML

Nº del proyecto: #15313928

Sobre el proyecto

14 propuestas Proyecto remoto Activo hace 6 años