Languages: Java or PHP
I'm looking for a programmer to develop in Java or PHP - to scrape and import court case records from webpages and save them in a csv.
The court records page is accessed after solving reCAPCHA. Bypassing the CAPCHA vis manual input of sessionid is acceptable.
The program will text extract the following fields for each cases and save them in the output csv:
This is the court website that will be used for the scraping:
[url removed, login to view]
Already have partially working code done in Java and JSoup using a valid session id hard coded. Successfully scraping multiple records. We need to refine and improve by adding page forward (next page) function and improve/rewrite existing scraping code. The code will be provided to save time.
https://eapps.courts.state.va.us/gdcourts/ is the URL
Please note that there are many courts (circuit, appeals, supream) but this project only covers the "general district courts"
Please note there are many general district courts. The data format / web access is all the same but the script must pass the court parameter variablei to select which court we are working with. This can be a parameter passed at the run time or a variable within the script.
Also, there will be three CSV files generated. (1) The main case file; (2) Hearing Information records uniquely identified by the Case_Number (one to many relationship to the main case file; (3) Service/Process records uniquely identified by the Case_Number (one to many relationship to the main case file.
In summary, There can be multiple "Hearing Information records" and/or "Service/Process records" for each case records. Therefore, we need for or while loop for each case record to extract one to many records for #2 and #3 records type discussed above.
42 freelancers están ofertando el promedio de $533 para este trabajo
Hi sir, i have been scraping for many years, PHP is not suitable for scraping but Java is awesome, i just want to know that you need the program or one the data ? i write script in C#. Waiting for the response.
I have already completed web scrapping using Java API. So I can complete this with high quality. Java API finalization is already completed from my side