PHP利用正则表达式将相对路径转成绝对路径的方

日期：2020-06-28 栏目：程序人生浏览：次

大家应该都有所体会，很多时候在做网络爬虫的时候特别需要将爬虫搜索到的超链接进行处理，统一都改成绝对路径的，所以本文就写了一个正则表达式来对搜索到的链接进行处理。下面话不多说，来看看详细的介绍吧。

通常我们可能会搜索到如下的链接：

<a href=""></a>  <a href=" " > </a>  <a href="https://www.jb51.net/index.html" alt="超链接"> index.html </a> <a href="https://www.jb51.net/" target="_blank"> / target="_blank" </a> <a target="_blank" href="https://www.jb51.net/" alt="超链接" > target="_blank" / alt="超链接" </a> <a target="_blank" title="超链接" href="https://www.jb51.net/" alt="超链接" > target="_blank" title="超链接" / alt="超链接" </a>  <a href="https://www.jb51.net/" > / </a> <a href="https://www.jb51.net/article/a" > a </a>  <a href="/index.html?id=1" > /index.html?id=1 </a> <a href="?id=2" > ?id=2 </a>  <a href="https://www.jb51.net/index.html" > //index.html </a> <a href="https://www.mafutian.net" > //www.mafutian.net </a>  <a href="https://www.hole_1.com/index.html" > </a>  <a href="https://www.mafutian.net" > </a> <a href="https://www.numberer.net" > </a>  <a href="https://www.jb51.net/article/1.jpg" > 1.jpg </a> <a href="https://www.jb51.net/article/1.jpeg" > 1.jpeg </a> <a href="https://www.jb51.net/article/1.gif" > 1.gif </a> <a href="https://www.jb51.net/article/1.png" > 1.png </a> <a href="https://www.jb51.net/article/1.txt" > 1.txt </a>  <a href="https://www.jb51.net/index.html" > index.html </a> <a href="https://www.jb51.net/index.html" > index.html </a> <a href="https://www.jb51.net/article/index.html" > ./index.html </a> <a href="https://www.jb51.net/index.html" > ../index.html </a> <a href="https://www.jb51.net/article/.../" > .../ </a> <a href="https://www.jb51.net/article/..." > ... </a>  <a href="javascript:void(0)" > javascript:void(0) </a> <a href="https://www.jb51.net/article/a:b" > a:b </a> <a href="/a#a:b" > /a#a:b </a> <a href="mailto:'mafutian@126.com'" > mailto:'mafutian@126.com' </a> <a href="/tencent://message/?uin=335134463" > /tencent://message/?uin=335134463 </a>  <a href="" > . </a> <a href="" > .. </a> <a href="https://www.jb51.net/" > ../ </a> <a href="https://www.jb51.net/a/b/.." > /a/b/.. </a> <a href="https://www.jb51.net/a" > /a </a> <a href="https://www.jb51.net/article/b" > ./b </a> <a href="https://www.jb51.net/article/././././b" > ./././././././././b </a>  <a href="https://www.jb51.net/c" > ../c </a> <a href="" > ../../d </a> <a href="" > ../a/../b/c/../d </a> <a href="https://www.jb51.net/e" > ./../e </a> <a href="https://www.hole_1.org/../e" > </a> <a href="https://www.jb51.net/./f" > ./.././f </a> <a href="https://www.hole_1.org/../a/.../../b/c/../d/.." > </a>  <a href="https://www.jb51.net/:8081/index.html" > :8081/index.html </a> <a href="https://www.mafutian.net:80/index.html" > :80/index.html </a> <a href="https://www.mafutian.net:8081/index.html" > :8081/index.html </a> <a href="https://www.mafutian.net:8082/index.html" > :8082/index.html </a>

处理的第一步，设置成绝对路径：

... / ../ ../

然后本文讲讲如何去除绝对路径中的 './'、'https://www.jb51.net/'、'/..'的实现代码：

总结

转载注明出处：https://www.heiqu.com/3d8951f563282af7ec7b79b5c8edc06a.html

PHP利用正则表达式将相对路径转成绝对路径的方

相关推荐