[工具资源] CSDN 已下载资源自动批量评论脚本

用 python 实现自动批量打分评论指定 CSDN 账号内所有下载过待评论的资源。

GitHub 仓库地址： https://github.com/mzlogin/csdncommenter

可通过 pip 安装运行：

pip install csdncommenter csdncommenter 背景

CSDN 账号过一段时间就会累积几十个下载过但是未评论打分的资源，虽然现在上传了一些资源供别人下载后基本不愁积分，但是为了可持续发展，还是把评论一下就能顺手拿了的这种积分不客气地收入囊中吧！不过手动一个一个去评论真的很蛋疼……特别是 CSDN 还搞了个两个评论间隔不能小于 60 秒的限制，评论几十个就得至少花个几十分钟折腾，所以想想这种耗时、无脑的活还是交给程序来完成吧。

对于这类模拟 HTTP 请求然后可能频繁用到页面解析和正则表达式之类的活，用 C++ 写还是有点蛋疼的，用我那半生不熟的 Python 练练手正合适。

遂在 GitHub 上建了个仓库开工，地址在这里： https://github.com/mzlogin/csdncommenter 。

分析

使用 Fiddler 把登录 - 到待评论页面 - 评论的完整流程抓了一下，整理程序逻辑大致如下：

注：如下 HTTP 请求均使用同一个 SESSION。

手动输入 CSDN 的用户名和密码。

用 GET 方法从 https://passport.csdn.net/account/login 页面获取 lt 、 execution 和 _eventId 等参数。

将第 1 步中的用户名和密码，还有第 2 步中得到的参数 POST 给 https://passport.csdn.net/account/login ，从 Response 中判断是否登录成功――我采用的依据是 status_code 为 200 且 Reponse 内容中有 lastLoginIP 。

用 GET 方法从 http://download.csdn.net/my/downloads 页面获取已下载资源总页数。从最后一个 pageliststy 的 href 中得到。

根据第 4 步中得到的总页数，根据每个页面 num 拼得 url 为 http://download.csdn.net/my/downloads/num ，使用 GET 方法访问之拿到该页面中所有待评论资源 ID。从所有 class="btn-comment" 的 a 标签的 href 中得到。

对第 5 步中得到的所有待评论资源 ID 依次进行间隔至少 60S 的打分评论，随机打出 1 到 5 星，对应一句英文短句评论。出乎我意料的是评论这一步竟然也是用 GET 就可以做， http://download.csdn.net/index.php/comment/post_comment 后面带上 sourceid 、 content （评论内容）、 rating （打分）和 t （时间戳）参数就可以。评论成功会返回 ({"succ":1}) ，失败会返回「两次评论需要间隔 60 秒」、「您已经发表过评论」等之类的 msg 。

最终运行截图如下：

确认这种方式能有效拿到 CSDN 的分数：

总结用 Python 干这种类型的活还是很有优势的，requests 和 BeautifulSoup 简直神器啊！我那点蹩脚的 Python 底子之所以能还比较顺利地把这个流程写下来，实际上也得亏 CSDN 对请求的验证相对较松，比如像我代码里那样写， User-Agent 是带有 Python 字样的，而且很显然不是浏览器在访问，但 CSDN 并未对此作限制。源码

没有找到从 Github Pages 引用 Github 仓库里的源码的方法

，所以把 py 文件放到一个 gist 里了，引用如下

：

（Gist 前几天被伟大的墙封了，还是直接贴上代码吧。2014/11/5 update）

（GitHub 仓库： mzlogin/csdncommenter ，现在可以通过 pip 安装使用了 pip install csdncommenter 然后 csdncommenter 。2015/10/27 update）

# File : CsdnCommenter.py # Author : Zhuang Ma # E-mail : ChumpMa(at)gmail.com # Website: http://www.mazhuang.org # Date : 2015-05-17 import requests from BeautifulSoup import BeautifulSoup import getpass import time import random import re import urllib import traceback class CsdnCommenter(): """Csdn operator""" def __init__(self): self.sess = requests.Session() def login(self): """login and keep session""" username = raw_input('username: ') password = getpass.getpass('password: ') url = 'https://passport.csdn.net/account/login' html = self.getUrlContent(self.sess, url) if html is None: return False soup = BeautifulSoup(html) lt = self.getElementValue(soup, 'name', 'lt') execution = self.getElementValue(soup, 'name', 'execution') _eventId = self.getElementValue(soup, 'name', '_eventId') data = { 'username' : username, 'password' : password, 'lt' : lt, 'execution' : execution, '_eventId' : _eventId } response = None try: response = self.sess.post(url, data) except: # traceback.print_exc() pass return self.isLoginSuccess(response) def autoComment(self): """main handler""" if self.getSourceIds() is False: print 'No source can comment!' return print 'Total %d source(s) wait for comment.' % len(self.sourceids) nhandled = 0 for sourceid in self.sourceids: left = len(self.sourceids) - nhandled sec = random.randrange(61,71) print 'Wait %d seconds for start. %s source(s) left.' % (sec, left) time.sleep(sec) self.comment(sourceid) nhandled += 1 print 'Finished!' def getSourceIds(self): """get source ids wait for comment""" self.sourceids = set() pagecount = self.getPageCount() if pagecount == 0: return False print 'Pagecount is %d.' % pagecount pattern = re.compile(r'.+/(\d+)#comment') for n in range(1, pagecount + 1): url = 'http://download.csdn.net/my/downloads/%d' % n html = self.getUrlContent(self.sess, url) if html is None: continue soup = BeautifulSoup(html) sourcelist = soup.findAll('a', attrs={'class' : 'btn-comment'}) if sourcelist is None: continue for source in sourcelist: href = source.get('href', None) if href is not None: rematch = pattern.match(href) if rematch is not None: self.sourceids.add(rematch.group(1)) return len(self.sourceids) > 0 def getPageCount(self): """get downloaded resources page count""" url = 'http://download.csdn.net/my/downloads' html = self.getUrlContent(self.sess, url) if html is None: print 'Get pagecount failed' return 0 soup = BeautifulSoup(html) pagelist = soup.findAll('a', attrs={'class' : 'pageliststy'}) if pagelist is None: return 0 lasthref = pagelist[len(pagelist) - 1].get('href', None) if lasthref is None: return 0 return int(filter(str.isdigit, str(lasthref))) def comment(self, sourceid): """comment per source""" print 'sourceid %s commenting...' % sourceid contents = [ 'It just soso, but thank you all the same.', 'Neither good nor bad.', 'It is a nice resource, thanks for share.', 'It is useful for me, thanks.', 'I have looking this for long, thanks.' ] rating = random.randrange(1,6) content = contents[rating - 1] t = '%d' % (time.time() * 1000) paramsmap = { 'sourceid' : sourceid, 'content' : content, 'rating' : rating, 't' : t } params = urllib.urlencode(paramsmap) url = 'http://download.csdn.net/index.php/comment/post_comment?%s' % params html = self.getUrlContent(self.sess, url) if html is None or html.find('({"succ":1})') == -1: print 'sourceid %s comment failed! response is %s.' % (sourceid, html) else: print 'sourceid %s comment succeed!' % sourceid @staticmethod def getElementValue(soup, element_name, element_value): element = soup.find(attrs={element_name : element_value}) if element is None: return None return element.get('value', None) @staticmethod def isLoginSuccess(response): if response is None or response.status_code != 200: return False return -1 != response.content.find('lastLoginIP') @staticmethod def getUrlContent(session, url): html = None try: response = session.get(url) if response is not None: html = response.text except requests.exceptions.ConnectionError as e: # traceback.print_exc() pass return html def main(): csdn = CsdnCommenter() while csdn.login() is False: print 'Login failed! Please try again.' print 'Login succeed!' csdn.autoComment() if __name__ == '__main__': main()

[工具资源] CSDN 已下载资源自动批量评论脚本

Trending Articles

[奇怪机翻组] 双梦相牵 / ふたりの夢もち [RJ01259078] [WebRip] [1080P HEVC-10Bit AAC 2.0]...

HONDA CITY VTI-S 菜單分享

#新闻拍一拍# 新的摩尔定律：黄氏定律

一如既往的痴情能否打动月瓶金蝎？ (豆瓣月亮水瓶小组)

求購按摩椅~'~

「粉红」不是霸凌辜莞允杠部落客：我爽在哪？

Intel 7-10代集成显卡驱动31.0.101.2137完整版

涉Gotbit加密货币市场操纵台男纽约被捕

臺灣法治會計學會2025年第三季研討會

不靠姊姊！張柏芝弟弟開計程車維生

关门一家亲：习远平、张澜澜、徐才厚

剑指offer——24.二叉树中和为某一值的路径

苏珊米勒日晕05.11｜狮子鼓励孩子；处女相信自己 (豆瓣 SUSAN MILLER小组)

【台積電IT卓越新戰略5】台積IT組織5年三次大調整，要靠平臺工程讓DevOps創新再加速

【日语无字】春之钟.Haru.no.kane.1985.JAP.vhsrip.NoSub.by.xiongzaixia&vivi

美籍老公不讓步李愛綺兒子念公立小學

新华网这张照片绝了!直讽江泽民宋祖英淫乱组图

湖州师范学院音乐学院开发的 Kontakt 8 明代魏氏乐琵琶/瑟/月琴音源即将发布

Google Chrome Portable 140.0.7339.186 穩定版免安裝中文版 - Google 瀏覽器

免费翻墙节点大全