Extract articles from PDF page -- 3

En curso Publicado hace un año Pagado a la entrega
En curso Pagado a la entrega

I need to extract articles from any PDF file like the sample attached.

You can find a sample of how the texts and regions are extracted here:

[login to view URL]

Here's a tool that promised to do the same but it's offline:

[login to view URL]

You're supposed to developed an article extraction that generates a JSON or XML file from any newspaper or magazine PDF file. In the image "[login to view URL]" you can see how it should be extracted from.

Technologies accepted: Java, Linux, Kotlin - open source, it can't depend on cloud or any other paid services.

Step1 - Development - You generate a json/xml from a pdf that follows these rules and you win the project.

Step2 - Tests - You send us the JAR (executable) file so we can test with other pdf files

Step3 - Payment - If works, we release you 50% of the payment and you send the sources. If it's everything ok with the source code you'll have the other half released.

Python PDF inteligencia artificial Image Processing OpenCV

Nº del proyecto: #34290794

Sobre el proyecto

5 propuestas Proyecto remoto Activo hace un año

5 freelancers están ofertando un promedio de $19 por este trabajo

jahoyz

Hi, I've read your description carefully. I have full experience with Python, PDF2XML I've also worked on several similar projects. So I can complete your project with high quality on time. Looking forward to hear more Más

$20 USD en 5 días
(10 comentarios)
3.9
stevst

Hi. How are you? As a highly skilled developer, I can help you perfectly. I am very confident with my skills and I'd like to help your business by doing my best. I always believe to make long business relation to clie Más

$20 USD en 7 días
(2 comentarios)
2.3
Digitalexpertuae

Hey there, I am a professional writer having experience in Python, Image Processing, PDF, OpenCV and Artificial Intelligence. Do you need an article or any piece of content written or rewritten and you do not want to Más

$10 USD en 2 días
(0 comentarios)
0.0
mahikakker16

Hi I am himanshi from india. My skills data entry into words opretor, copy typing, article writing, health education tips etc Degital marketing is Fun it's not hard because I do work very easily and clearly in this pla Más

$20 USD en 4 días
(0 comentarios)
0.0