一聚教程网:一个值得你收藏的教程网站

最新下载

热门教程

Python urllib2抓取Wordpress用户学习笔记

时间:2014-11-10 编辑:简简单单 来源:一聚教程网

刚学python,就试着找个实例试试。

 代码如下 复制代码

#!/usr/bin/env python
#coding=utf-8
import sys
import urllib2
import string
import re
import os
demo = """

"""
try:
    if len(sys.argv) < 2:
        print "----------------------------------------"
        print demo
        print "----------------------------------------"
        sys.exit()
    lj = "/?author="
    url = sys.argv[1] + lj
    path = os.getcwd() #获取文件路径
    url_name = re.findall('http://(.*\w)',sys.argv[1]) #获取域名
    url_name_txt = url_name[0] + '.txt'
    print "Username to %s\\%s"%(path,url_name_txt)
    for i in xrange(1,1111): #循环ID
        web = url + str(i)
        code = urllib2.urlopen(web).read() #获取源码
        username = re.findall('(.*\w) |',code) #正则匹配用户名<br />         username_txt = open(str(url_name_txt),'a')<br />         username_txt.writelines(username)<br />         username_txt.writelines("\n")<br />         username_txt.close()<br /> except:<br />     print 'Just for wp.py'</p> </td> </tr> </table> <p>效果图:</p> <p><a href="https://img.111com.net/get_pic/2014/11/10/20141110153926748.png" class="js-smartPhoto-pc" target="_blank"><img style=" " alt="image" src="https://img.111com.net/get_pic/2014/11/10/20141110153926748.png" /></a></p></td> </tr> </table> </div> <div class="pages art-detail"> </div> <ul class="TurnPage"> <li class="TurnPage-left"> <p> <span>上一个:</span> <a href="https://www.111com.net/phper/php/69656.htm" class="maxWidth">PHP-redis中文帮助文档</a> </p> </li> <li class="TurnPage-right"> <p> <span>下一个:</span> <a href="https://www.111com.net/phper/php/69706.htm" class="maxWidth">PHP如何销毁已经过期的变量并释放内存?unset使用方法</a> </p> </li> </ul> <div class="articles"> <div class="tit02"> <h4>相关文章</h4> </div> <ul> <li> <a target="_blank" href="https://www.111com.net/phper/228481.htm">PHP导出数据超时的优化建议解读</a> <span>10-31</span> </li> <li> <a target="_blank" href="https://www.111com.net/phper/228478.htm">PHP之mysql位运算解析</a> <span>10-31</span> </li> <li> <a target="_blank" href="https://www.111com.net/phper/228475.htm">Laravel实现登录跳转功能解析</a> <span>10-31</span> </li> <li> <a target="_blank" href="https://www.111com.net/phper/228473.htm">php双向队列解读</a> <span>10-31</span> </li> <li> <a target="_blank" href="https://www.111com.net/phper/226305.htm">Laravel异常上下文解决教程</a> <span>10-24</span> </li> <li> <a target="_blank" href="https://www.111com.net/phper/226295.htm">php数组查询元素位置方法介绍</a> <span>10-24</span> </li> </ul> </div> </div> </div> </div> </div> <div class="hot-column"> <div class="cont"> <div class="tit"> <h4>热门栏目</h4> </div> <ul class="clearfix"> <li> <h6><a href="https://www.111com.net/phper/php.html" target="_blank">php教程</a></h6> <a href="https://www.111com.net/list-45/" target="_blank">php入门</a> <a href="https://www.111com.net/list-46/" target="_blank">php安全</a> <a href="https://www.111com.net/list-47/" target="_blank">php安装</a> <a href="https://www.111com.net/list-48/" target="_blank">php常用代码</a> <a href="https://www.111com.net/list-49/" target="_blank">php高级应用</a> </li> <li> <h6><a href="https://www.111com.net/net/net.html" target="_blank">asp.net教程</a></h6> <a href="https://www.111com.net/list-78/" target="_blank">基础入门</a> <a href="https://www.111com.net/list-79/" target="_blank">.Net开发</a> <a href="https://www.111com.net/list-80/" target="_blank">C语言</a> <a href="https://www.111com.net/list-81/" target="_blank">VB.Net语言</a> <a href="https://www.111com.net/list-82/" target="_blank">WebService</a> </li> <li> <h6><a href="https://www.111com.net/sj/index.html" target="_blank">手机开发</a></h6> <a href="https://www.111com.net/list-208/" target="_blank">安卓教程</a> <a href="https://www.111com.net/list-209/" target="_blank">ios7教程</a> <a href="https://www.111com.net/list-210/" target="_blank">Windows Phone</a> <a href="https://www.111com.net/list-211/" target="_blank">Windows Mobile</a> <a href="https://www.111com.net/list-212/" target="_blank">手机常见问题</a> </li> <li> <h6><a href="https://www.111com.net/cssdiv/css.html" target="_blank">css教程</a></h6> <a href="https://www.111com.net/list-99/" target="_blank">CSS入门</a> <a href="https://www.111com.net/list-100/" target="_blank">常用代码</a> <a href="https://www.111com.net/list-101/" target="_blank">经典案例</a> <a href="https://www.111com.net/list-102/" target="_blank">样式布局</a> <a href="https://www.111com.net/list-103/" target="_blank">高级应用</a> </li> <li> <h6><a href="https://www.111com.net/wy/yw.html" target="_blank">网页制作</a></h6> <a href="https://www.111com.net/list-136/" target="_blank">设计基础</a> <a href="https://www.111com.net/list-137/" target="_blank">Dreamweaver</a> <a href="https://www.111com.net/list-138/" target="_blank">Frontpage</a> <a href="https://www.111com.net/list-139/" target="_blank">js教程</a> <a href="https://www.111com.net/list-140/" target="_blank">XNL/XSLT</a> </li> <li> <h6><a href="https://www.111com.net/office/index.html" target="_blank">办公数码</a></h6> <a href="https://www.111com.net/list-236/" target="_blank">word</a> <a href="https://www.111com.net/list-237/" target="_blank">excel</a> <a href="https://www.111com.net/list-238/" target="_blank">powerpoint</a> <a href="https://www.111com.net/list-239/" target="_blank">金山WPS</a> <a href="https://www.111com.net/list-240/" target="_blank">电脑新手</a> </li> <li> <h6><a href="https://www.111com.net/jsp/jsp.html" target="_blank">jsp教程</a></h6> <a href="https://www.111com.net/list-68/" target="_blank">Application与Applet</a> <a href="https://www.111com.net/list-69/" target="_blank">J2EE/EJB/服务器</a> <a href="https://www.111com.net/list-70/" target="_blank">J2ME开发</a> <a href="https://www.111com.net/list-71/" target="_blank">Java基础</a> <a href="https://www.111com.net/list-72/" target="_blank">Java技巧及代码</a> </li> </ul> </div> </div> <div class="footer"> <div class="cont"> <p> <a href="https://www.111com.net/" target="_self">一聚教程网</a>| <a href="https://www.111com.net/us/us.html" class="about" target="_self">关于我们</a>| <a href="https://www.111com.net/us/me.html" class="contact" target="_self">联系我们</a>| <a href="https://www.111com.net/us/ads.html" class="gg_contact" target="_self">广告合作</a>| <a href="https://www.111com.net/us/link.html" class="friend_link" target="_self">友情链接</a>| <a href="https://www.111com.net/us/bcinfo.html" class="copyright_notice" target="_self">版权声明</a> </p> <p> <span>copyRight@2007-2024 www.111COM.NET AII Right Reserved <a href="https://beian.miit.gov.cn/" target="_blank" class="beian">苏ICP备17065847号-2</a> </span> </p> <p> <span> 网站内容来自网络整理或网友投稿如有侵权行为请邮件:yijucomnet@163.com 我们24小时内处理 </span> </p> </div> </div> <script src="https://assets.111com.net/js/stat.js?v=2024022101"></script> <script src="https://api.111com.net/api/stat/hits?type=article&id=69657"></script> </body> </html>