We are working with a company that creates key, actionable insights that help litigators win more motions, win more cases, and win more business. They've patented, AI-powered analysis of tens of millions of state court litigation documents uncovering hidden patterns of behavior of judges, law firms, lawyers, litigants, and more, coast-to-coast. A tool built by litigators, for litigators. This company is a mid-term startup that’s growing up. They’ve been around long enough to be stable as a company and have secured their position as a keystone resource in the world of legal analytics. Data is the heart of this business and we’re looking for experienced talent to help provide durable and scalable processes and level-up.
What you’ll do: - Oversee our data collection team, writing new crawlers, and ensuring accurate and daily collection of data from hundreds of court websites - Understanding of object-oriented programming - Ability to write clean and documented code (PEP8 standard preferred) - Familiarity with HTML, HTTP requests, Amazon Web Services, Linux, regular expressions - Experience working with CAPTCHA and IP rotation - Knowledge of version control software (Github) - Critical thinking and troubleshooting skills What you’ll bring: - 5+ years of experience designing and delivering large scale, 24/7 crawl infrastructure and pipelines - Proven experience designing, operating, and improving crawlers - Excellent Python programming skills - Strong understanding of AWS including S3 buckets and Lambdas - Experience with Dexi or other similar crawling software - Ability to thrive in an agile, entrepreneurial start-up environment - Comfortable working in a Unix (Linux or Mac) environment - Comfortable in a version-controlled code environment - Advanced knowledge of data acquisition technologies such as web crawling, PDF and data extraction
Only candidates from Argentina, Brazil, Chile, Colombia, Costa Rica, Ecuador, El Salvador, Mexico, Panama, Peru, Uruguay
Intermediate or advanced spoken English is required for ALL opportunities. If you can't speak English yet, please keep practicing and apply in the future.