xpath剔除某些不需要的资源/元素:
html.xpath("//h1[@name='hname' and not(contains(@class,'cname2'))]//text()")
//span[starts-with(@id,'nothread')]/following::*[1][name()='blockquote']/text()
// DIV [@类=“ipsType_normal ipsType_richText ipsContained “]/p [not(@ class =”ipsQuote“)]
html.xpath('/html/body/*[not(name()="form")]//text()')



![[图片] “草它妈”遭众多网友恶搞1楼 2008-09-16 15:41:50-微慑信息网-VulSee.com](http://img1.bbs.163.com/20080916/baoliao/gz/gz_fmks/40c2776edd3032f7029b1bc4b37dec49.jpg)


![Nodejs 调试踩坑 [webstorm]-微慑信息网-VulSee.com](http://vulsee.com/wp-content/uploads/2020/05/496.png)

![[八卦] 王婷婷—揭秘一个大三女生的性爱录像-微慑信息网-VulSee.com](http://free.86hy.com/crack/pic/1.jpg)
![[随笔]今天国际警察节-微慑信息网-VulSee.com](http://photo.sohu.com/20041017/Img222528326.jpg)

青云网
