Summary: This role involves designing and developing scalable web crawling and data extraction systems aimed at collecting significant amounts of web data