Desktop ( or web) app scrape PDF OCR tesseract django poppler

Completado Publicado hace 6 años Pagado a la entrega
Completado Pagado a la entrega

Desktop ( or web) app to scrape PDF available at a url and save the extracted text into one of the column of a mysql/postgre db

Extractor Module (this can be a desktop app or a webAPP)

1) I have a database of url links

2) Fetch pdf from the url links from step 1.

3) Extract text (attached sample file) from the pdf links

4) Store the extracted info into postgresql(preferred) or mysql

provide some basic selection criteria for testing with smaller loaders.

provide lag seconds input so that the PDF source website is not overloaded with our fetch.

-->examples of similar projects that you have worked on? How can i verify this?

-->if you are in different timezone, are you available for 7am US pacific time meeting?

-->What are your hourly rates for tasks beyond the requirements in this job post?

->What technologies you will use for Scraping module?

I need atleast 2 business days for testing after its bug free. I will release 80% of the payment.

20% after additional 5 business days of testing.

Java OCR PHP Python

Nº del proyecto: #15484456

Sobre el proyecto

9 propuestas Proyecto remoto Activo hace 6 años

Adjudicado a:

riyazaec

Hi, With the help of python and tesseract, I have converted the pdf2txt. Please find the converted data from the below URL and ping for further discussion. Relevant Skills and Experience PHP, Python, Django (OCR on PD Más

$250 USD en 3 días
(16 comentarios)
4.8

9 freelancers están ofertando un promedio de $198 por este trabajo

Angel521

I am very interested in your project I have 6 years of web scraping history and I have scrapped more than 1000 website I have already done like this job (bypassing captcha or else other scrap preventing tool is no pr Más

$144 USD en 3 días
(103 comentarios)
7.4
prakash2813

Hi there, I can make you desktop app which will do this job. I have strong knowledge in Tesseract OCR, but can't guarantee 100% accuracy as fonts and size is different in attached pdfs. Message me, Relevant Skills and Más

$260 USD en 5 días
(101 comentarios)
6.2
kazimking

Hello Sir, hope you are fine, I am interested in your work "pdf to text". I test your PDF in newocr.com. Relevant Skills and Experience I have 5 year experience in Web Development Service I am able to this job quick Más

$250 USD en 3 días
(39 comentarios)
5.2
anuragiitk

I am an IITK graduate and I have 9 years of experience in software development. I have 100% completion rate and I have finished all the projects with the highest level of customer satisfaction. Relevant Skills and Exp Más

$155 USD en 3 días
(24 comentarios)
5.6
amolupraity

I have experience in OCR, python and PDF manipulation. The task will be completed by the proposed deadline. Relevant Skills and Experience I have more than 2 years of experience in OCR and Python. Proposed Milestones Más

$120 USD en 3 días
(17 comentarios)
3.6
qwerk

Hello there, I have worked on java wrapper of tessaract api. I can show you my app which takes image as input and returns text. I have also experience of goolge vision api. Ping me Relevant Skills and Experience Java Más

$200 USD en 3 días
(9 comentarios)
3.2
GITTechBAY

A proposal has not yet been provided

$244 USD en 3 días
(9 comentarios)
2.4