Sort by pandas condition

Question

I have a table with cities (the pandas.DataFrame object with a single 'city' column) table and the pandas.Series - cities object, which is the first column of this table ( cities = table['city'] ).

I need to sort the cities in such a way that only the words that satisfy this condition are left in it. In this case, I need to leave only cities in cities beginning with the letter 'м' . I need something similar to the action described below, but only with the help of pandas, not lists:

 lst = [word for word in cities if word[0] == 'м']

How can I accomplish this? I tried it, but it gives me a KeyError: False

 lst = table.loc[cities[0] == 'м']

Although, if you need to leave only the fields with the specified value, it works:

 lst= table.loc[cities != 'магадан']

How is it possible in the pandas.Series object to leave only the fields starting with a given letter?

MaxU MaxU 52.3k 6 18 51 · Accepted Answer · 2018-11-17T21:15:06

Source DataFrame:

 In [250]: table Out[250]: city 0 магадан 1 Киев 2 Мариуполь 3 Запорожье 4 Москва

Solution options:

 In [252]: table[table['city'].str.lower().str.startswith('м')] Out[252]: city 0 магадан 2 Мариуполь 4 Москва

or using regular expressions:

 In [253]: table[table['city'].str.contains('^м', flags=re.I)] Out[253]: city 0 магадан 2 Мариуполь 4 Москва In [254]: table[table['city'].str.match('^м.*$', flags=re.I)] Out[254]: city 0 магадан 2 Мариуполь 4 Москва

Sort by pandas condition

1 answer 1

More articles: