Desktop ( or web) app scrape PDF OCR tesseract django poppler
$30-250 USD
Pagado a la entrega
Desktop ( or web) app to scrape PDF available at a url and save the extracted text into one of the column of a mysql/postgre db
Extractor Module (this can be a desktop app or a webAPP)
1) I have a database of url links
2) Fetch pdf from the url links from step 1.
3) Extract text (attached sample file) from the pdf links
4) Store the extracted info into postgresql(preferred) or mysql
provide some basic selection criteria for testing with smaller loaders.
provide lag seconds input so that the PDF source website is not overloaded with our fetch.
-->examples of similar projects that you have worked on? How can i verify this?
-->if you are in different timezone, are you available for 7am US pacific time meeting?
-->What are your hourly rates for tasks beyond the requirements in this job post?
->What technologies you will use for Scraping module?
I need atleast 2 business days for testing after its bug free. I will release 80% of the payment.
20% after additional 5 business days of testing.
Nº del proyecto: #15484456
Sobre el proyecto
9 freelancers están ofertando un promedio de $198 por este trabajo
Hi there, I can make you desktop app which will do this job. I have strong knowledge in Tesseract OCR, but can't guarantee 100% accuracy as fonts and size is different in attached pdfs. Message me, Relevant Skills and Más
I am an IITK graduate and I have 9 years of experience in software development. I have 100% completion rate and I have finished all the projects with the highest level of customer satisfaction. Relevant Skills and Exp Más
I have experience in OCR, python and PDF manipulation. The task will be completed by the proposed deadline. Relevant Skills and Experience I have more than 2 years of experience in OCR and Python. Proposed Milestones Más