采集原理---采集技术篇---XMLHTTP(2)

日期：2020-05-13 栏目：程序人生浏览：次

on error resume next '有错误时继续执行代码
Dim http '定义变量
'Set http=Server.createobject(XmlHttpCom) '申请对象
Set http=Server.createobject("Microsoft.XMLHTTP") '保险起见，写出一个服务器一般都支持的版本
Http.open "GET",url,False '打开对象用GET方式等待服务器响应
Http.Send() '发送
If Http.readystate<>4 Then '如果服务器没反应,则退出函数
Exit Function
End If

GetFileText=bytes2BSTR(Http.responseBody,"GB2312") '把得到的数据流二进制文件转化成文本字符格式 (GB2312)

Set http=Nothing '删除对象
If err.number<>0 Then err.Clear '如果有错误,清除错误
End Function

'// <summary>
'// 采用 ADODB.Stream　处理采集到的数据，把二进制的文件转成文本字符
'// </summary>
Function Bytes2bStr(vin,cSet)
Dim BytesStream,StringReturn
Set BytesStream = Server.createObject("ADODB.Stream")
BytesStream.Type = 2
BytesStream.Open
BytesStream.WriteText vin
BytesStream.Position = 0
BytesStream.CharSet = cSet
BytesStream.Position = 2
StringReturn =BytesStream.ReadText
BytesStream.close
Set BytesStream = Nothing
Bytes2bStr = StringReturn
End Function

下面我定义一个路径变量 URL

URL = "http://ent.sina.com.cn/star/mainland/more.html";

上面是一个网址,如果我们想把上面这个地址采集下来,并显示出来的话，我们可以这样操作

URL = "http://ent.sina.com.cn/star/mainland/more.html";

Response.Write GetFileText(URL)

这样就可以采集到上面网址的内容了
是不是很简单呢

那采集到数据之后应该怎么操作呢
怎么区分数据，如果得到你想要的数据，如果把得到的数据入库呢
这是以后需要分析讲解的问题了入库要注意的地方，用正表达式处理数据

附上上面代码的源文件，大家可以下载下去，运行起来试试，是不是真的能采集到数据库

转载注明出处：http://www.heiqu.com/3438.html

采集原理---采集技术篇---XMLHTTP(2)

相关推荐