Data Collection Engineer

Midlevel
Remote

Questions?   Contact us!

Company

We are working with a company that creates key, actionable insights that help litigators win more motions, win more cases, and win more business. They've patented, AI-powered analysis of tens of millions of state court litigation documents uncovering hidden patterns of behavior of judges, law firms, lawyers, litigants, and more, coast-to-coast. A tool built by litigators, for litigators. This company is a mid-term startup that’s growing up. They’ve been around long enough to be stable as a company and have secured their position as a keystone resource in the world of legal analytics. Data is the heart of this business and we’re looking for experienced talent to help provide durable and scalable processes and level-up.

Job Description

What you’ll do: - Oversee our data collection team, writing new crawlers, and ensuring accurate and daily collection of data from hundreds of court websites - Understanding of object-oriented programming - Ability to write clean and documented code (PEP8 standard preferred) - Familiarity with HTML, HTTP requests, Amazon Web Services, Linux, regular expressions - Experience working with CAPTCHA and IP rotation - Knowledge of version control software (Github) - Critical thinking and troubleshooting skills What you’ll bring: - 5+ years of experience designing and delivering large scale, 24/7 crawl infrastructure and pipelines - Proven experience designing, operating, and improving crawlers - Excellent Python programming skills - Strong understanding of AWS including S3 buckets and Lambdas - Experience with Dexi or other similar crawling software - Ability to thrive in an agile, entrepreneurial start-up environment - Comfortable working in a Unix (Linux or Mac) environment - Comfortable in a version-controlled code environment - Advanced knowledge of data acquisition technologies such as web crawling, PDF and data extraction

Notes

Only candidates from Argentina, Brazil, Chile, Colombia, Costa Rica, Ecuador, El Salvador, Mexico, Panama, Peru, Uruguay

Intermediate or advanced spoken English is required for ALL opportunities. If you can't speak English yet, please keep practicing and apply in the future.

Already have an account?

Sign In to load your details

Apply now

Please complete our online questionnaire

Upload your photo
First Name (Required)
Last Name (Required)
Email (Required)
Summary (Required)
LinkedIn Profile (Required)
GitHub Profile (Optional)
Stack Overflow Profile (Optional)
Portfolio (Optional)
English Resume (CV) (Required) Under 5MB please. English. PDF only.
Availability (Required)
Full-time Monthly Salary Expectations (in USD) (Required)
Country (Required)
Are you able to hold an interview in English? (Required)
On a scale of 1-10: what is your level of spoken English? (Required)
Required Skills (Experience in years) (Required)
How many years of your career have you worked remotely? (Required)
How many years have you worked professionally in software? (Required)