Observe Python 2.7.3 (default, Sep 4 2012, 20:19:03) [GCC 4.2.1 20070831 patched [FreeBSD]] on freebsd9 Type “help”, “copyright”, “credits” or “license” for more information. >>> junk=”’ <a href=””http://a0c5e.site.it/r”” target=_blank><font color=#808080>MailUp</font></a> … <a href=””http://www.site.it/prodottiLLPP.php?id=1″” class=””txtBlueGeorgia16″”>Prodotti</a> … <a href=””http://www.site.it/terremoto.php”” target=””blank”” class=””txtGrigioScuroGeorgia12″”>Terremoto</a> … <a class=”mini” href=”http://www.site.com/remove/professionisti.aspx?Id=65&Code=xhmyskwzse”>clicca qui.</a>`”’ >>> import re >>> pat=re.compile(r”’http[\:/a-zA-Z0-9\.\?\=&]*”’) >>> pat.findall(junk) [‘http://a0c5e.site.it/r’, ‘http://www.site.it/prodottiLLPP.php?id=1’, ‘http://www.site.it/terremoto.php’, ‘http://www.site.com/remove/professionisti.aspx?Id=65&Code=xhmyskwzse’] … Read more