Название книги: Скрапинг веб-сайтов с помощью Python
Автор: Райан Митчелл
Формат: pdf, mobi, epub, fb2
Размер: 6.5 Мб, 3.1 Мб, 3.1 Мб, 11.0 Мб
Описание книги «Скрапинг веб-сайтов с помощью Python»:
To those who have not developed the skill, computer programming can seem like a kind of magic. If programming is magic, then web scraping is wizardry; that is, the application of magic for particularly impressive and useful — yet surprisingly effortless — feats.
In fact, in my years as a software engineer, I’ve found that very few programming practices capture the excitement of both programmers and laymen alike quite like web scraping. The ability to write a simple bot that collects data and streams it down a terminal or stores it in a database, while not difficult, never fails to provide a certain thrill and sense of possibility, no matter how many times you might have done it before.
This book seeks to put an end to many of these common questions and misconceptions about web scraping, while providing a comprehensive guide to most common web-scraping tasks.
This book is designed to serve not only as an introduction to web scraping, but as a comprehensive guide to scraping almost every type of data from the modern Web Although it uses the Python programming language, and covers many Python basics, it should not be used as an introduction to the language.
- Your First Web Scraper
- Advanced HTML Parsing
- Starting to Crawl
- Using APIs
- Storing Data
- Reading Documents
- Cleaning Your Dirty Data
- Reading and Writing Natural Languages
- Crawling Through Forms and Logins
- Image Processing and Text Recognition
- Avoiding Scraping Traps
- Testing Your Website with Scrapers
- Scraping Remotely
Appendix A: Python at a Glance
Appendix B: The Internet at a Glance
Appendix C: The Legalities and Ethics of Web Scraping
Скачать книгу «Скрапинг веб-сайтов с помощью Python»