Название книги: Скрапинг веб-сайтов с помощью Python
Год: 2016
Автор: Райан Митчелл
Страниц: 238
Язык: Английский
Формат: pdf, mobi, epub, fb2
Размер: 6.5 Мб, 3.1 Мб, 3.1 Мб, 11.0 Мб
Описание книги “Скрапинг веб-сайтов с помощью Python”:
To those who have not developed the skill, computer programming can seem like a kind of magic. If programming is magic, then web scraping is wizardry; that is, the application of magic for particularly impressive and useful – yet surprisingly effortless – feats.
In fact, in my years as a software engineer, I’ve found that very few programming practices capture the excitement of both programmers and laymen alike quite like web scraping. The ability to write a simple bot that collects data and streams it down a terminal or stores it in a database, while not difficult, never fails to provide a certain thrill and sense of possibility, no matter how many times you might have done it before.
It’s unfortunate that when I speak to other programmers about web scraping, there’s a lot of misunderstanding and confusion about the practice. Some people aren’t sure if it’s legal (it is), or how to handle the modern Web, with all its JavaScript, multimedia, and cookies. Some get confused about the distinction between APIs and web scrapers.
This book seeks to put an end to many of these common questions and misconceptions about web scraping, while providing a comprehensive guide to most common web-scraping tasks.
This book is designed to serve not only as an introduction to web scraping, but as a comprehensive guide to scraping almost every type of data from the modern Web Although it uses the Python programming language, and covers many Python basics, it should not be used as an introduction to the language.
Оглавление:
Preface
- Your First Web Scraper
- Advanced HTML Parsing
- Starting to Crawl
- Using APIs
- Storing Data
- Reading Documents
- Cleaning Your Dirty Data
- Reading and Writing Natural Languages
- Crawling Through Forms and Logins
- Scraping JavaScript
- Image Processing and Text Recognition
- Avoiding Scraping Traps
- Testing Your Website with Scrapers
- Scraping Remotely
Appendix A: Python at a Glance
Appendix B: The Internet at a Glance
Appendix C: The Legalities and Ethics of Web Scraping
Index
Скачать книгу “Скрапинг веб-сайтов с помощью Python”