We Are Web Scraping and Data Analysis Experts

Use intoli to source and understand the data that powers your core business. We have the skills and experience necessary to identify the hidden structure in your data and choose the right approach and the right tools for any scenario.

Let's Talk

Advanced Web Scraping

Running into issues with CAPTCHAs, JavaScript, or rate limiting? We can help you get the data you need quickly and reliably.

Historical Data

Interested in tracking trends and changes over time? We can provide API access to historically scraped data, and can even scrape retroactively from cached sources .

Data Insights

Are you having trouble extracting actionable insights from your data? We can help you select and implement the right machine learning algorithm for your needs.

Data Pipelines

We can help you make important architectural decisions and set up your data infrastructure. We will help your team avoid common and costly pitfalls.

Full Stack Development

We are experts in a wide range of web technologies, and can integrate our scrapers with existing projects on the frontend, in an extension, or on the server.

Quick Turnaround

We’ll work with you to schedule milestones, and get you the data you need quickly and efficiently.

Testimonials

We’ve worked with many clients and we do everything we can to make sure they’re happy with the results.
Have a look at what some of them have said about us.

Looking to solve your data needs?

Let us know what you're working on and we'll be happy to help you find the best solution.

Get Started

From our blog

Check out all the cool stuff we do.

Building Data Science Pipelines with Luigi and Jupyter Notebooks

By Mattia Ciollaro on November 28, 2017

Learn about the Luigi task runner and how to use Jupyter notebooks in your workflows.

Continue reading

Dangerous Pickles — Malicious Python Serialization

By Evan Sangaline on October 17, 2017

A light introduction to the Python pickle protocol, the Pickle Machine, and constructing malicious pickles.

Continue reading

A Brief Tour of Grouping and Aggregating in Pandas

By Andre Perunicic on October 13, 2017

Learn how to use pandas to easily slice up a dataset and quickly extract useful statistics.

Continue reading

Designing The Wayback Machine Loading Animation

By Evan Sangaline on October 11, 2017

A walkthrough of how we helped The Internet Archive design a new loading animation for the Wayback Machine.

Continue reading

Our Clients

Meet the Team

We’ve been good friends and developing code together for twelve years.
Find out what makes us the perfect team to help you meet your business needs.

Evan Sangaline, PhD

Evan has been an avid programmer for 19 years and has shipped projects in over a dozen languages. His career began in experimental higher energy physics where he managed distributed computing infrastructures and performed award winning research on particle identification. This work included the development of a ground breaking unsupervised machine learning technique that significantly outperformed all existing approaches. He later switched fields to statistics where he developed the strongly intensive cumulants and made the first Bayesian determination of the nuclear equation of state using advanced statistical techniques designed to accomodate otherwise prohibitively expensive models.

Since leaving academia, he has founded a startup that used artificial intelligence to make video games more fun, written technical articles that hundreds of thousands of people have enjoyed, and helped numerous companies build their products or meet their data needs.

Andre Perunicic, PhD

After getting his Ph.D. in math, Andre spent two years working as a postdoc at research institutions in Canada. His academic work centered on applying ideas from mathematical physics and string theory to number theory, and he developed techniques for greatly simplifying certain extremely labor intensive calculations.

His mathematical training and life-long programming experience allowed for an easy transition to industry, where he has helped multiple teams meet their business and data science needs. He worked on desktop and web applications, as well as data science projects, and has a detailed understanding of machine learning algorithms and techniques.

Before Intoli, he most recently worked in the data science department of Spreemo Health, where he used Bayesian techniques to define analytical metrics used to measure quality of radiology services. He helped identify key predictive factors for high quality MRI exams, and demonstrated drastic differences amongst various radiology providers.