Scrapy 1.2.1 发布，web 爬虫框架

Scrapy 1.2.1 发布了。

Scrapy 是一套基于基于Twisted的异步处理框架，纯python实现的爬虫框架，用户只需要定制开发几个模块就可以轻松的实现一个爬虫，用来抓取网页内容以及各种图片。

更新内容：新功能

New FEED_EXPORT_ENCODING setting to customize the encoding used when writing items to a file. This can be used to turn off \uXXXX escapes in JSON output. This is also useful for those wanting something else than UTF-8 for XML or CSV output ( #2034 ).

startproject command now supports an optional destination directory to override the default one based on the project name ( #2005 ).

New SCHEDULER_DEBUG setting to log requests serialization failures ( #1610 ).

JSON encoder now supports serialization of set instances ( #2058 ).

Interpret application/json-amazonui-streaming as TextResponse ( #1503 ).

scrapy is imported by default when using shell tools ( shell , inspect_response ) ( #2248 ).

Bug 修复

DefaultRequestHeaders middleware now runs before UserAgent middleware ( #2088 ). Warning: this is technically backwards incompatible , though we consider this a bug fix.

HTTP cache extension and plugins that use the .scrapy data directory now work outside projects ( #1581 ). Warning: this is technically backwards incompatible , though we consider this a bug fix.

Selector does not allow passing both response and text anymore ( #2153 ).

Fixed logging of wrong callback name with scrapy parse ( #2169 ).

Fix for an odd gzip decompression bug ( #1606 ).

Fix for selected callbacks when using CrawlSpider with scrapy parse ( #2225 ).

Fix for invalid JSON and XML files when spider yields no items ( #872 ).

Implement flush() for StreamLogger avoiding a warning in logs ( #2125 ).

重构

canonicalize_url has been moved to w3lib.url ( #2168 ).

下载地址：

Scrapy 1.2.1 发布，web 爬虫框架

Trending Articles

[奇怪机翻组] 双梦相牵 / ふたりの夢もち [RJ01259078] [WebRip] [1080P HEVC-10Bit AAC 2.0]...

HONDA CITY VTI-S 菜單分享

#新闻拍一拍# 新的摩尔定律：黄氏定律

一如既往的痴情能否打动月瓶金蝎？ (豆瓣月亮水瓶小组)

求購按摩椅~'~

「粉红」不是霸凌辜莞允杠部落客：我爽在哪？

Intel 7-10代集成显卡驱动31.0.101.2137完整版

涉Gotbit加密货币市场操纵台男纽约被捕

臺灣法治會計學會2025年第三季研討會

不靠姊姊！張柏芝弟弟開計程車維生

关门一家亲：习远平、张澜澜、徐才厚

剑指offer——24.二叉树中和为某一值的路径

苏珊米勒日晕05.11｜狮子鼓励孩子；处女相信自己 (豆瓣 SUSAN MILLER小组)

【台積電IT卓越新戰略5】台積IT組織5年三次大調整，要靠平臺工程讓DevOps創新再加速

【日语无字】春之钟.Haru.no.kane.1985.JAP.vhsrip.NoSub.by.xiongzaixia&vivi

美籍老公不讓步李愛綺兒子念公立小學

新华网这张照片绝了!直讽江泽民宋祖英淫乱组图

湖州师范学院音乐学院开发的 Kontakt 8 明代魏氏乐琵琶/瑟/月琴音源即将发布

Google Chrome Portable 140.0.7339.186 穩定版免安裝中文版 - Google 瀏覽器

免费翻墙节点大全