Don't get the url parsed with Cyrillic characters
from bs4 import BeautifulSoup from urllib import request html_doc = request.urlopen('http://кто.рф/').read() soup = BeautifulSoup(html_doc) title = soup.title.string print (title) I constantly see the same error.
UnicodeEncodeError: 'latin-1' codec can't encode characters in position 0-2: ordinal not in range(256) Encoding decode does not help.
Python3.4. BeautifulSoup 4.3.2