TY - BOOK AU - Mitchell,Ryan TI - Web scraping with Python: collecting data from the modern web SN - 9789352131457 AV - QA76.73.P98 M58 2015 U1 - 005.133 MIT 23 PY - 2015/// CY - Sebastopol, CA PB - O'Reilly Media KW - Python (Computer program language) KW - Data mining KW - Automatic data collection systems N1 - Includes index; Part I. Building scrapers: Your first web scraper -- Advanced HTML parsing -- Starting to crawl -- Using APIs -- Storing data -- Reading documents. Part II. Advanced scraping: Cleaning your dirty data -- Reading and writing natural languages -- Crawling through forms and logins -- Scraping JavaScript -- Image processing and text recognition -- Avoiding scraping traps -- Testing your website with scrapers -- Testing your website with scrapers -- Scraping remotely -- Python at a glance -- The internet at a glance -- The legalities and ethics of web scraping N2 - Learn web scraping and crawling techniques to access data from any web source in any format. Teaches basic web scraping mechanics, but also delves into more advanced topics, such as analyzing raw data or using scrapers for frontend website testing ER -