I will automate your data extraction and CSV cleaning with python
Data Engineer
About this Gig
Hi, Im César. I'm an engineer with 3+ years of experience building data systems and automating workflows for government and infrastructure clients.
I don't just run basic scraper tools; I write custom Python scripts to solve messy data problems. As an example, in a recent project, I wrote a script that pulled mapped contact data from over 1,000 unstructured legal PDFs in under 4 minutesa task that previously took a team 60+ manual hours.
Here is what I can build for you:
- Custom Data Extraction: Pulling clean data from websites, even if they have logins, pagination, or dynamic JavaScript.
- Data Cleaning & Formatting: Taking your messy CSVs, Excel files, or PDFs and using Pandas to filter, deduplicate, and format them exactly how you need them.
- ETL Pipelines: Moving raw data from any source into clean, structured outputs (CSV, JSON, SQL).
- Automated Scripts: I can deliver the fully documented Python source code so you can run the extraction yourself whenever you want.
Every script I deliver is modular, heavily commented, and built to handle errors without crashing.
Important: Please send me a quick message with your target URL or sample file before placing an order. I like to scope eve
Technology:
Excel
•
Google Sheets
•
Python
•
Zapier
FAQ
What file formats can you work with?
I can process CSV, Excel (.xlsx/.xls), JSON, PDF, Word (.docx), and data from websites or APIs. If you have a different format, message me — I'll let you know if I can handle it.
Do I get the Python script, or just the cleaned data?
The Basic package delivers the cleaned output only. Standard and Premium packages include the documented source code so you can re-run the automation yourself anytime.
How large can my dataset be?
Basic handles up to 500 rows. Standard up to 5,000 rows. For datasets larger than 5,000 rows or requiring database integration, choose Premium or message me for a custom quote.

