用正则匹配一些特殊的a标签,求救

lihui_life 2011-05-20 11:40:00
要用正则匹配出 以域名开头的a标签,或以“/”开头的a标签 ,但url地址必须以 .aspx 结尾
如:<a href="http://jiancai.xyecom.com/sell-info.aspx">test</a>
<a href="/sell-info.aspx">test</a>

以下是程序获取到得部分html代码

<li><a hre
f=\"#\" onclick=\"xy_SelectSearchMenu('品牌','brand','');\">品牌</a></li>\r\n\t\t\t\t\t <li><a href=\"#\" onclick=\"xy_SelectSearchMenu('展会','exhibition','');\">展会</a></li>\r\n\t\t\t\t\t</ul>\r\n\t\t\t\t</div>\r\n\t\t\t\t<input type=\"hidden\" id=\"xy_FlagName\" value=\"offer\"/>\r\n\t\t\t\t<input type=\"hidden\" id=\"xy_InfoType\" value=\"sell\"/>\r\n\t\t\t\t<button value=\"搜索\" id=\"DoSearch\" name=\"DoSearch\" class=\"btsearch\" onclick=\"xy_search();\"/>搜索</button>\r\n\t\t\t\t<a href=\"/search/advanced_search.aspx\" class=\"black\">[高级搜索]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/contributor.aspx\" class=\"black\">[投稿]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/post.aspx\" class=\"black\">[发布信息]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/baike/index.aspx\" class=\"black\">[百科]</a>\r\n\t\t\t</div>\r\n\t\t\t<div id=\"sch_bnr\"><a href=\"\"><img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_5.jpg\" width=\"170\" height=\"77\" alt=\"\" /></a></div>\r\n\t\t\t<div id=\
"hot_schs\">\r\n\t\t\t\t<ul>\r\n\t\t\t\t\t<li><strong>热门搜索词:</strong></li>\r\n\t\t\t\t\t<li>\r\n\t\t\t\t\t\t<ul>\r\n\t\t\t\t\t\t\t<li><a target=\"_blank\" href=\"/search/seller_search-offer--花花-------.aspx\">花花</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--方法-------.aspx\">方法</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--huahua-------.aspx\">huahua</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--ggf-------.aspx\">ggf</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--ggf,r6-------.aspx\">ggf,r6</a></li>\r\n\t\t\t\t\t\t</ul>\r\n\t\t\t\t\t</li>\r\n\t\t\t\t</ul>\r\n\t\t\t</div>\r\n\t\t</div>\r\n\t</div>\r\n <div id=\"best_cmpy\">\r\n <ul>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <im
g src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_3.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_4.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_3.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_4.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n
<li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n </ul>\r\n </div>\r\n <div id=\"idx_cnt1\">\r\n <div id=\"left\">\r\n <div class=\"top spacer\">\r\n </div>\r
\n <div class=\"main\">\r\n <div class=\"tit\">\r\n <h2>\r\n <span>热点信息</span></h2>\r\n </div>\r\n <ul class=\"hinfo-text\">\r\n <li><a href=\"http://jiancai.xyecom.com/news/channel-5.aspx\">[营销学院]</a> <a href=\"http://jiancai.xyecom.com/news/content-10.aspx\"><font style=color:#008000;font-weight:bold;font-style:italic;text-decoration:underline>test1323232</font></a></li><li><a href=\"http://jiancai.xyecom.com/news/channel-3.aspx\">[市场分析]</a> <a href=\"http://jiancai.xyecom.com/news/content-8.aspx\">测试测试测试</a></li>\r\n </ul>\r\n <ul class=\"hinfo-img\">\r\n <li><a href=\"#\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_6.jpg\" width=\"118\"\r\n height=\"42\" alt=\"\" /><span>沈阳市金山技术有限公司</span></a></li>\r\n
<li><a href=\"#\" class=\"orange\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_7.jpg\" width=\"118\"\r\n height=\"42\" alt=\"\" /><span>华北造纸生产基地</span></a></li>\r\n </ul>\r\n <div class=\"clr\">\r\n </div>\r\n </div>\r\n <div class=\"btm spacer\">\r\n </div>\r\n </div>\r\n <div id=\"mid\">\r\n <a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_bnr_2.jpg\" width=\"360\"\r\n height=\"194\" alt=\"\" /></a>\r\n </div>\r\n <div id=\"right\">\r\n\t<!--未登录显示-->\r\n\t<div class=\"B_8 r\" style=\"display:none;\" id=\"xy_login_nologin\">\r\n\t\t<div class=\"B_8\">\r\n\t\t\t<div class=\"B_9\">立即登录</div>\r\n\t\t\t<div class=\"B_10\"><a class=\"white\" href=\"http://jiancai.xyecom.com/user\"
>进入商务管理后台</a></div>\r\n\t\t</div>\r\n\t\t<div class=\"B_11_2 clr\">\r\n\t\t\t<div class=\"B_12\">\r\n\t\t\t\t<div class=\"B_12\">\r\n\t\t\t\t\t<div class=\"B_13\">用户名:</div>\r\n\t\t\t\t\t<div class=\"B_15\"><input type=\"text\" tabindex=\"1\" class=\"tf1\" name=\"_lbox_username\" id=\"_lbox_username\" onkeydown =\"_xy_KeyPress('_btnBoxLogin');\"/></div>\r\n\t\t\t\t</div>\r\n\t\t\t\t<div class=\"B_12\">\r\n\t\t\t\t\t<div class=\"B_13\">密 码:</div>\r\n\t\t\t\t\t<div class=\"B_15\"><input type=\"password\" tabindex=\"2\" class=\"tf1\" name=\"_lbox_password\" id=\"_lbox_password\" onkeydown =\"_xy_KeyPress('_btnBoxLogin');\"/></div>\r\n\t\t\t\t</div>\r\n\t\t\t\t<div class=\"B_16 clr\" style=\"padding-top:5px;\">\r\n\t\t\t\t <input id=\"_btnBoxLogin\" height=\"29\" width=\"108\" type=\"image\" src=\"http://jiancai.xyecom.com/templates/default/images/dabn6.gif\" onclick=\"return xy_BoxLogin();\"/> \r\n\t\t\t\t <a href=\"http://jiancai.xyecom.com/getpassword.aspx\">忘记密码?</a>\r\n\t\t\t\t</div>\r\n\t\t\t</div>\r
\n\t\t\t<div class=\"login_links\">\r\n\t\t\t\t<ul class=\"service\">\r\n\t\t\t\t\t<li class=\"s_1\"><a href=\"http://jiancai.xyecom.com/user/\"><strong>发布信息</strong></a></li>\r\n\t\t\t\t\t<li class=\"s_2\"><a href=\"http://jiancai.xyecom.com/register.aspx\"><strong>注册会员</strong></a></li>\r\n\t\t\t\t</ul>\r\n\t\t\t\t<div class=\"clr\"></div>\r\n\t\t\t</div>\r\n\t\t</div>\r\n\t</div>\r\n <!--登录后显示-->\r\n\t<div class=\"B_8 r\" id=\"xy_login_logined\">\r\n\t\t<div class=\"B_8\">\r\n\t\t\t<div class=\"B_9\">欢迎登录</div>\r\n\t\t\t<div class=\"B_10\"><a class=\"white\" href=\"\" id=\"_lbox_ucenter\">进入商务管理后台</a></div>\r\n\t\t</div>\r\n\t\t<div class=\"B_11 clr\">\r\n\t\t\t<div class=\"E_1\"><font class=\"huei14\">欢迎您!</font> <span id=\"_lbox_uname\"></span> [<a target=\"_self\" class=\"hui12i\" href=\"http://jiancai.xyecom.com/logout.aspx\">退出</a>]</div>\r\n\t\t\t<div class=\"B_12 clr\">\r\n\t\t\t\t<div class=\"E_6\"><a href=\"http://jiancai.xyecom.com/user/infoselect.aspx\">发布产品</a> | <a href=\"http://jiancai.xyec
om.com/user/addnews.aspx\">发布资讯</a> | <a href=\"http://jiancai.xyecom.com/user/receivemessagelist.aspx\">我的留言</a></div>\r\n\t\t\t\t<div style=\"padding-top:20px;padding-left: 37px;\" class=\"E_6 mt10\"><a href=\"/user/\
...全文
394 6 打赏 收藏 转发到动态 举报
写回复
用AI写文章
6 条回复
切换为时间正序
请发表友善的回复…
发表回复
q107770540 2011-05-23
  • 打赏
  • 举报
回复

void Main()
{
string str="<li><a href=\"#\" onclick=\"xy_SelectSearchMenu('品牌','brand','');\">品牌</a></li>\r\n\t\t\t\t\t <li><a href=\"#\" onclick=\"xy_SelectSearchMenu('展会','exhibition','');\">展会</a></li>\r\n\t\t\t\t\t</ul>\r\n\t\t\t\t</div>\r\n\t\t\t\t<input type=\"hidden\" id=\"xy_FlagName\" value=\"offer\"/>\r\n\t\t\t\t<input type=\"hidden\" id=\"xy_InfoType\" value=\"sell\"/>\r\n\t\t\t\t<button value=\"搜索\" id=\"DoSearch\" name=\"DoSearch\" class=\"btsearch\" onclick=\"xy_search();\"/>搜索</button>\r\n\t\t\t\t<a href=\"/search/advanced_search.aspx\" class=\"black\">[高级搜索]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/contributor.aspx\" class=\"black\">[投稿]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/post.aspx\" class=\"black\">[发布信息]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/baike/index.aspx\" class=\"black\">[百科]</a>\r\n\t\t\t</div>\r\n\t\t\t<div id=\"sch_bnr\"><a href=\"\"><img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_5.jpg\" width=\"170\" height=\"77\" alt=\"\" /></a></div>\r\n\t\t\t<div id=\"hot_schs\">\r\n\t\t\t\t<ul>\r\n\t\t\t\t\t<li><strong>热门搜索词:</strong></li>\r\n\t\t\t\t\t<li>\r\n\t\t\t\t\t\t<ul>\r\n\t\t\t\t\t\t\t<li><a target=\"_blank\" href=\"/search/seller_search-offer--花花-------.aspx\">花花</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--方法-------.aspx\">方法</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--huahua-------.aspx\">huahua</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--ggf-------.aspx\">ggf</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--ggf,r6-------.aspx\">ggf,r6</a></li>\r\n\t\t\t\t\t\t</ul>\r\n\t\t\t\t\t</li>\r\n\t\t\t\t</ul>\r\n\t\t\t</div>\r\n\t\t</div>\r\n\t</div>\r\n <div id=\"best_cmpy\">\r\n <ul>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_3.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_4.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_3.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_4.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n </ul>\r\n </div>\r\n <div id=\"idx_cnt1\">\r\n <div id=\"left\">\r\n <div class=\"top spacer\">\r\n </div>\r\n <div class=\"main\">\r\n <div class=\"tit\">\r\n <h2>\r\n <span>热点信息</span></h2>\r\n </div>\r\n <ul class=\"hinfo-text\">\r\n <li><a href=\"http://jiancai.xyecom.com/news/channel-5.aspx\">[营销学院]</a> <a href=\"http://jiancai.xyecom.com/news/content-10.aspx\"><font style=color:#008000;font-weight:bold;font-style:italic;text-decoration:underline>test1323232</font></a></li><li><a href=\"http://jiancai.xyecom.com/news/channel-3.aspx\">[市场分析]</a> <a href=\"http://jiancai.xyecom.com/news/content-8.aspx\">测试测试测试</a></li>\r\n </ul>\r\n <ul class=\"hinfo-img\">\r\n <li><a href=\"#\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_6.jpg\" width=\"118\"\r\n height=\"42\" alt=\"\" /><span>沈阳市金山技术有限公司</span></a></li>\r\n <li><a href=\"#\" class=\"orange\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_7.jpg\" width=\"118\"\r\n height=\"42\" alt=\"\" /><span>华北造纸生产基地</span></a></li>\r\n </ul>\r\n <div class=\"clr\">\r\n </div>\r\n </div>\r\n <div class=\"btm spacer\">\r\n </div>\r\n </div>\r\n <div id=\"mid\">\r\n <a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_bnr_2.jpg\" width=\"360\"\r\n height=\"194\" alt=\"\" /></a>\r\n </div>\r\n <div id=\"right\">\r\n\t<!--未登录显示-->\r\n\t<div class=\"B_8 r\" style=\"display:none;\" id=\"xy_login_nologin\">\r\n\t\t<div class=\"B_8\">\r\n\t\t\t<div class=\"B_9\">立即登录</div>\r\n\t\t\t<div class=\"B_10\"><a class=\"white\" href=\"http://jiancai.xyecom.com/user\">进入商务管理后台</a></div>\r\n\t\t</div>\r\n\t\t<div class=\"B_11_2 clr\">\r\n\t\t\t<div class=\"B_12\">\r\n\t\t\t\t<div class=\"B_12\">\r\n\t\t\t\t\t<div class=\"B_13\">用户名:</div>\r\n\t\t\t\t\t<div class=\"B_15\"><input type=\"text\" tabindex=\"1\" class=\"tf1\" name=\"_lbox_username\" id=\"_lbox_username\" onkeydown =\"_xy_KeyPress('_btnBoxLogin');\"/></div>\r\n\t\t\t\t</div>\r\n\t\t\t\t<div class=\"B_12\">\r\n\t\t\t\t\t<div class=\"B_13\">密 码:</div>\r\n\t\t\t\t\t<div class=\"B_15\"><input type=\"password\" tabindex=\"2\" class=\"tf1\" name=\"_lbox_password\" id=\"_lbox_password\" onkeydown =\"_xy_KeyPress('_btnBoxLogin');\"/></div>\r\n\t\t\t\t</div>\r\n\t\t\t\t<div class=\"B_16 clr\" style=\"padding-top:5px;\">\r\n\t\t\t\t <input id=\"_btnBoxLogin\" height=\"29\" width=\"108\" type=\"image\" src=\"http://jiancai.xyecom.com/templates/default/images/dabn6.gif\" onclick=\"return xy_BoxLogin();\"/> \r\n\t\t\t\t <a href=\"http://jiancai.xyecom.com/getpassword.aspx\">忘记密码?</a>\r\n\t\t\t\t</div>\r\n\t\t\t</div>\r\n\t\t\t<div class=\"login_links\">\r\n\t\t\t\t<ul class=\"service\">\r\n\t\t\t\t\t<li class=\"s_1\"><a href=\"http://jiancai.xyecom.com/user/\"><strong>发布信息</strong></a></li>\r\n\t\t\t\t\t<li class=\"s_2\"><a href=\"http://jiancai.xyecom.com/register.aspx\"><strong>注册会员</strong></a></li>\r\n\t\t\t\t</ul>\r\n\t\t\t\t<div class=\"clr\"></div>\r\n\t\t\t</div>\r\n\t\t</div>\r\n\t</div>\r\n <!--登录后显示-->\r\n\t<div class=\"B_8 r\" id=\"xy_login_logined\">\r\n\t\t<div class=\"B_8\">\r\n\t\t\t<div class=\"B_9\">欢迎登录</div>\r\n\t\t\t<div class=\"B_10\"><a class=\"white\" href=\"\" id=\"_lbox_ucenter\">进入商务管理后台</a></div>\r\n\t\t</div>\r\n\t\t<div class=\"B_11 clr\">\r\n\t\t\t<div class=\"E_1\"><font class=\"huei14\">欢迎您!</font> <span id=\"_lbox_uname\"></span> [<a target=\"_self\" class=\"hui12i\" href=\"http://jiancai.xyecom.com/logout.aspx\">退出</a>]</div>\r\n\t\t\t<div class=\"B_12 clr\">\r\n\t\t\t\t<div class=\"E_6\"><a href=\"http://jiancai.xyecom.com/user/infoselect.aspx\">发布产品</a> | <a href=\"http://jiancai.xyecom.com/user/addnews.aspx\">发布资讯</a> | <a href=\"http://jiancai.xyecom.com/user/receivemessagelist.aspx\">我的留言</a></div>\r\n\t\t\t\t<div style=\"padding-top:20px;padding-left: 37px;\" class=\"E_6 mt10\"><a href=\"/user/\"";
foreach(Match m in Regex.Matches(str,@"(?i)<a[^>]*href=([""'])?(http://jiancai.xyecom.com)?/[^""']+\.(aspx|html)\1[^>]*>[^<>]*</a>"))
{
Console.WriteLine(m.Value);
}

/*
<a href="/search/advanced_search.aspx" class="black">[高级搜索]</a>
<a href="http://jiancai.xyecom.com/contributor.aspx" class="black">[投稿]</a>
<a href="http://jiancai.xyecom.com/post.aspx" class="black">[发布信息]</a>
<a href="http://jiancai.xyecom.com/baike/index.aspx" class="black">[百科]</a>
<a target="_blank" href="/search/seller_search-offer--花花-------.aspx">花花</a>
<a target="_blank" href="/search/seller_search-offer--方法-------.aspx">方法</a>
<a target="_blank" href="/search/seller_search-offer--huahua-------.aspx">huahua</a>
<a target="_blank" href="/search/seller_search-offer--ggf-------.aspx">ggf</a>
<a target="_blank" href="/search/seller_search-offer--ggf,r6-------.aspx">ggf,r6</a>
<a href="http://jiancai.xyecom.com/news/channel-5.aspx">[营销学院]</a>
<a href="http://jiancai.xyecom.com/news/channel-3.aspx">[市场分析]</a>
<a href="http://jiancai.xyecom.com/news/content-8.aspx">测试测试测试</a>
<a href="http://jiancai.xyecom.com/getpassword.aspx">忘记密码?</a>
<a target="_self" class="hui12i" href="http://jiancai.xyecom.com/logout.aspx">退出</a>
<a href="http://jiancai.xyecom.com/user/infoselect.aspx">发布产品</a>
<a href="http://jiancai.xyecom.com/user/addnews.aspx">发布资讯</a>
<a href="http://jiancai.xyecom.com/user/receivemessagelist.aspx">我的留言</a>

*/
}


下次提问前先想清楚问题
一次性把问题描述清楚
lihui_life 2011-05-20
  • 打赏
  • 举报
回复

非常感谢1楼得答案,我先测试下,呵呵 上面的回复发错了,
[Quote=引用 1 楼 q107770540 的回复:]
C# code

void Main()
{
string str="<li><a href=\"#\" onclick=\"xy_SelectSearchMenu('品牌','brand','');\">品牌</a></li>\r\n\t\t\t\t\t <li><a href=\"#\" onclick=\"xy_SelectSearchMenu('展会','exhibitio……
[/Quote]
lihui_life 2011-05-20
  • 打赏
  • 举报
回复
非常感谢1楼得答案,我先测试下,呵呵
[Quote=引用楼主 lihuioooo 的回复:]
要用正则匹配出 以域名开头的a标签,或以“/”开头的a标签 ,但url地址必须以 .aspx 结尾
如:<a href="http://jiancai.xyecom.com/sell-info.aspx">test</a>
<a href="/sell-info.aspx">test</a>

以下是程序获取到得部分html代码

<li><a hre
f=\"#\" oncl……
[/Quote]
laowang134 2011-05-20
  • 打赏
  • 举报
回复
不为蹭分,只为mark下学习。。
q107770540 2011-05-20
  • 打赏
  • 举报
回复

void Main()
{
string str="<li><a href=\"#\" onclick=\"xy_SelectSearchMenu('品牌','brand','');\">品牌</a></li>\r\n\t\t\t\t\t <li><a href=\"#\" onclick=\"xy_SelectSearchMenu('展会','exhibition','');\">展会</a></li>\r\n\t\t\t\t\t</ul>\r\n\t\t\t\t</div>\r\n\t\t\t\t<input type=\"hidden\" id=\"xy_FlagName\" value=\"offer\"/>\r\n\t\t\t\t<input type=\"hidden\" id=\"xy_InfoType\" value=\"sell\"/>\r\n\t\t\t\t<button value=\"搜索\" id=\"DoSearch\" name=\"DoSearch\" class=\"btsearch\" onclick=\"xy_search();\"/>搜索</button>\r\n\t\t\t\t<a href=\"/search/advanced_search.aspx\" class=\"black\">[高级搜索]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/contributor.aspx\" class=\"black\">[投稿]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/post.aspx\" class=\"black\">[发布信息]</a>\r\n\t\t\t\t<a href=\"http://jiancai.xyecom.com/baike/index.aspx\" class=\"black\">[百科]</a>\r\n\t\t\t</div>\r\n\t\t\t<div id=\"sch_bnr\"><a href=\"\"><img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_5.jpg\" width=\"170\" height=\"77\" alt=\"\" /></a></div>\r\n\t\t\t<div id=\"hot_schs\">\r\n\t\t\t\t<ul>\r\n\t\t\t\t\t<li><strong>热门搜索词:</strong></li>\r\n\t\t\t\t\t<li>\r\n\t\t\t\t\t\t<ul>\r\n\t\t\t\t\t\t\t<li><a target=\"_blank\" href=\"/search/seller_search-offer--花花-------.aspx\">花花</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--方法-------.aspx\">方法</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--huahua-------.aspx\">huahua</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--ggf-------.aspx\">ggf</a></li><li><a target=\"_blank\" href=\"/search/seller_search-offer--ggf,r6-------.aspx\">ggf,r6</a></li>\r\n\t\t\t\t\t\t</ul>\r\n\t\t\t\t\t</li>\r\n\t\t\t\t</ul>\r\n\t\t\t</div>\r\n\t\t</div>\r\n\t</div>\r\n <div id=\"best_cmpy\">\r\n <ul>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_3.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_4.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_3.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_4.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_1.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n <li><a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_2.jpg\" width=\"191\"\r\n height=\"62\" alt=\"\" /></a></li>\r\n </ul>\r\n </div>\r\n <div id=\"idx_cnt1\">\r\n <div id=\"left\">\r\n <div class=\"top spacer\">\r\n </div>\r\n <div class=\"main\">\r\n <div class=\"tit\">\r\n <h2>\r\n <span>热点信息</span></h2>\r\n </div>\r\n <ul class=\"hinfo-text\">\r\n <li><a href=\"http://jiancai.xyecom.com/news/channel-5.aspx\">[营销学院]</a> <a href=\"http://jiancai.xyecom.com/news/content-10.aspx\"><font style=color:#008000;font-weight:bold;font-style:italic;text-decoration:underline>test1323232</font></a></li><li><a href=\"http://jiancai.xyecom.com/news/channel-3.aspx\">[市场分析]</a> <a href=\"http://jiancai.xyecom.com/news/content-8.aspx\">测试测试测试</a></li>\r\n </ul>\r\n <ul class=\"hinfo-img\">\r\n <li><a href=\"#\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_6.jpg\" width=\"118\"\r\n height=\"42\" alt=\"\" /><span>沈阳市金山技术有限公司</span></a></li>\r\n <li><a href=\"#\" class=\"orange\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_pic_7.jpg\" width=\"118\"\r\n height=\"42\" alt=\"\" /><span>华北造纸生产基地</span></a></li>\r\n </ul>\r\n <div class=\"clr\">\r\n </div>\r\n </div>\r\n <div class=\"btm spacer\">\r\n </div>\r\n </div>\r\n <div id=\"mid\">\r\n <a href=\"\">\r\n <img src=\"http://jiancai.xyecom.com/templates/default/images/temp_bnr_2.jpg\" width=\"360\"\r\n height=\"194\" alt=\"\" /></a>\r\n </div>\r\n <div id=\"right\">\r\n\t<!--未登录显示-->\r\n\t<div class=\"B_8 r\" style=\"display:none;\" id=\"xy_login_nologin\">\r\n\t\t<div class=\"B_8\">\r\n\t\t\t<div class=\"B_9\">立即登录</div>\r\n\t\t\t<div class=\"B_10\"><a class=\"white\" href=\"http://jiancai.xyecom.com/user\">进入商务管理后台</a></div>\r\n\t\t</div>\r\n\t\t<div class=\"B_11_2 clr\">\r\n\t\t\t<div class=\"B_12\">\r\n\t\t\t\t<div class=\"B_12\">\r\n\t\t\t\t\t<div class=\"B_13\">用户名:</div>\r\n\t\t\t\t\t<div class=\"B_15\"><input type=\"text\" tabindex=\"1\" class=\"tf1\" name=\"_lbox_username\" id=\"_lbox_username\" onkeydown =\"_xy_KeyPress('_btnBoxLogin');\"/></div>\r\n\t\t\t\t</div>\r\n\t\t\t\t<div class=\"B_12\">\r\n\t\t\t\t\t<div class=\"B_13\">密 码:</div>\r\n\t\t\t\t\t<div class=\"B_15\"><input type=\"password\" tabindex=\"2\" class=\"tf1\" name=\"_lbox_password\" id=\"_lbox_password\" onkeydown =\"_xy_KeyPress('_btnBoxLogin');\"/></div>\r\n\t\t\t\t</div>\r\n\t\t\t\t<div class=\"B_16 clr\" style=\"padding-top:5px;\">\r\n\t\t\t\t <input id=\"_btnBoxLogin\" height=\"29\" width=\"108\" type=\"image\" src=\"http://jiancai.xyecom.com/templates/default/images/dabn6.gif\" onclick=\"return xy_BoxLogin();\"/> \r\n\t\t\t\t <a href=\"http://jiancai.xyecom.com/getpassword.aspx\">忘记密码?</a>\r\n\t\t\t\t</div>\r\n\t\t\t</div>\r\n\t\t\t<div class=\"login_links\">\r\n\t\t\t\t<ul class=\"service\">\r\n\t\t\t\t\t<li class=\"s_1\"><a href=\"http://jiancai.xyecom.com/user/\"><strong>发布信息</strong></a></li>\r\n\t\t\t\t\t<li class=\"s_2\"><a href=\"http://jiancai.xyecom.com/register.aspx\"><strong>注册会员</strong></a></li>\r\n\t\t\t\t</ul>\r\n\t\t\t\t<div class=\"clr\"></div>\r\n\t\t\t</div>\r\n\t\t</div>\r\n\t</div>\r\n <!--登录后显示-->\r\n\t<div class=\"B_8 r\" id=\"xy_login_logined\">\r\n\t\t<div class=\"B_8\">\r\n\t\t\t<div class=\"B_9\">欢迎登录</div>\r\n\t\t\t<div class=\"B_10\"><a class=\"white\" href=\"\" id=\"_lbox_ucenter\">进入商务管理后台</a></div>\r\n\t\t</div>\r\n\t\t<div class=\"B_11 clr\">\r\n\t\t\t<div class=\"E_1\"><font class=\"huei14\">欢迎您!</font> <span id=\"_lbox_uname\"></span> [<a target=\"_self\" class=\"hui12i\" href=\"http://jiancai.xyecom.com/logout.aspx\">退出</a>]</div>\r\n\t\t\t<div class=\"B_12 clr\">\r\n\t\t\t\t<div class=\"E_6\"><a href=\"http://jiancai.xyecom.com/user/infoselect.aspx\">发布产品</a> | <a href=\"http://jiancai.xyecom.com/user/addnews.aspx\">发布资讯</a> | <a href=\"http://jiancai.xyecom.com/user/receivemessagelist.aspx\">我的留言</a></div>\r\n\t\t\t\t<div style=\"padding-top:20px;padding-left: 37px;\" class=\"E_6 mt10\"><a href=\"/user/\"";
foreach(Match m in Regex.Matches(str,@"(?i)<a[^>]*href=([""'])?[^""']+\.aspx\1[^>]*>[^<>]*</a>"))
{
Console.WriteLine(m.Value);
}
/*
<a href="/search/advanced_search.aspx" class="black">[高级搜索]</a>
<a href="http://jiancai.xyecom.com/contributor.aspx" class="black">[投稿]</a>
<a href="http://jiancai.xyecom.com/post.aspx" class="black">[发布信息]</a>
<a href="http://jiancai.xyecom.com/baike/index.aspx" class="black">[百科]</a>
<a target="_blank" href="/search/seller_search-offer--花花-------.aspx">花花</a>
<a target="_blank" href="/search/seller_search-offer--方法-------.aspx">方法</a>
<a target="_blank" href="/search/seller_search-offer--huahua-------.aspx">huahua</a>
<a target="_blank" href="/search/seller_search-offer--ggf-------.aspx">ggf</a>
<a target="_blank" href="/search/seller_search-offer--ggf,r6-------.aspx">ggf,r6</a>
<a href="http://jiancai.xyecom.com/news/channel-5.aspx">[营销学院]</a>
<a href="http://jiancai.xyecom.com/news/channel-3.aspx">[市场分析]</a>
<a href="http://jiancai.xyecom.com/news/content-8.aspx">测试测试测试</a>
<a href="http://jiancai.xyecom.com/getpassword.aspx">忘记密码?</a>
<a target="_self" class="hui12i" href="http://jiancai.xyecom.com/logout.aspx">退出</a>
<a href="http://jiancai.xyecom.com/user/infoselect.aspx">发布产品</a>
<a href="http://jiancai.xyecom.com/user/addnews.aspx">发布资讯</a>
<a href="http://jiancai.xyecom.com/user/receivemessagelist.aspx">我的留言</a>
*/
}

lihui_life 2011-05-20
  • 打赏
  • 举报
回复
如果为了过滤其他站上的url 而只要http://jiancai.xyecom.com 域名下的url,或者以“/”开头的url,切url后缀有可能是html,aspx,改怎么写呢?
对不起,我正则太差劲了,
[Quote=引用 1 楼 q107770540 的回复:]
C# code

void Main()
{
string str="<li><a href=\"#\" onclick=\"xy_SelectSearchMenu('品牌','brand','');\">品牌</a></li>\r\n\t\t\t\t\t <li><a href=\"#\" onclick=\"xy_SelectSearchMenu('展会','exhibitio……
[/Quote]

62,025

社区成员

发帖
与我相关
我的任务
社区描述
.NET技术交流专区
javascript云原生 企业社区
社区管理员
  • ASP.NET
  • .Net开发者社区
  • R小R
加入社区
  • 近7日
  • 近30日
  • 至今
社区公告

.NET 社区是一个围绕开源 .NET 的开放、热情、创新、包容的技术社区。社区致力于为广大 .NET 爱好者提供一个良好的知识共享、协同互助的 .NET 技术交流环境。我们尊重不同意见,支持健康理性的辩论和互动,反对歧视和攻击。

希望和大家一起共同营造一个活跃、友好的社区氛围。

试试用AI创作助手写篇文章吧