for example, I have this line
desc=u"привет 123123123 🙆🏼🙆🏼🙆🏼 тут какой то текст 12349! abcde 123" I found a partial solution:
re.sub(r'[^\x00-\x7F]+',' ', desc) or
"".join(filter(lambda x: ord(x)<128,desc.decode('utf-8'))) but the problem is that all Cyrillic characters are deleted and it turns out:
123123123 12349! abcde 123 and the line may also have m², this is also a special character. I would like to leave him.