I have a list of news sites
[ 'ria.ru', 'www.rbc.ru', 'lenta.ru', 'news.rambler.ru', 'kp.ru', 'iz.ru', 'www.gazeta.ru', 'vesti.ru', 'www.mk.ru', 'news.ngs.ru', 'russian.rt.com', 'life.ru', 'ren.tv', 'smi2.ru', 'kommersant.ru', 'svpressa.ru', 'tass.ru', 'cosmo.ru', 'lentainform.com', 'ura.ru', 'echo.msk.ru', 'vz.ru', 'www.aif.ru', 'dni.ru', 'www.ridus.ru', 'E1.RU', 'ridus.ru', 'rg.ru', 'tsargrad.tv', 'eg.ru', ] Of these, I need to take only links to articles. I use find_all ("a"), and skip through the loop. But the problem is that I have to filter the real links to the article from others. Any ideas how to implement this?