pandas Archives - Page 2 of 10

[Solved] Scraped CSV pandas dataframe I get: ValueError(‘Length of values does not match length of ‘ ‘index’)

January 14, 2023 by Kirat

You need merge with inner join: print(‘####CURRIES###’) df1 = pd.read_csv(‘C:\\O\\df1.csv’, index_col=False, usecols=[0,1,2], names=[“EW”, “WE”, “DA”], header=None) print(df1.head()) ####CURRIES### EW WE \ 0 can v can 1.90 1 Lanus U20 v Argentinos Jrs U20 2.10 2 Botafogo RJ U20 v Toluca U20 1.83 3 Atletico Mineiro U20 v Bahia U20 2.10 4 FC Porto v Monaco … Read more

[Solved] Is there any pandas function to merge 3 rows?

January 13, 2023 by Kirat

Let’s take the following sample DataFrame, containing 2 groups of 3 adjacent rows: C1 C2 C3 C4 C5 C6 ABC NaN NaN NaN NaN PK KJ PQR NaN NaN RR SS NaN NaN MNO PO UI NaN NaN NaN NaN XXX AA NaN NaN NaN EE NaN XX1 NaN BB NaN DD NaN FF1 XX2 … Read more

[Solved] How do I make a function in python which takes a list of integers as an input and outputs smaller lists with only two values?

January 11, 2023 by Kirat

If you only want groups of two (as opposed to groups of n), then you can hardcode n=2 and use a list comprehension to return a list of lists. This will also create a group of one at the end of the list if the length of the list is odd: some_list = [‘a’,’b’,’c’,’d’,’e’] [some_list[i:i+2] … Read more

[Solved] How do I filter an empty DataFrame and still keep the columns of that DataFrame?

January 8, 2023 by Kirat

df2 = df1[df1.B.apply(lambda x:x == 1).astype(bool)] All other answers are missing the point (except for Wen’s, which is an ok alternative) 1 solved How do I filter an empty DataFrame and still keep the columns of that DataFrame?

[Solved] How to store missing date(15 min interval) points from csv into new file (15 minutes interval) -python 3.5

January 6, 2023 by Kirat

try this: In [16]: df.ix[df.groupby(df[‘datetime’].dt.date)[‘production’].transform(‘nunique’) < 44 * 4 * 24, ‘datetime’].dt.date.unique() Out[16]: array([datetime.date(2015, 12, 7)], dtype=object) this will give you all rows for the “problematic” days: df[df.groupby(df[‘datetime’].dt.date)[‘production’].transform(‘nunique’) < 44 * 4 * 24] PS there is a good reason why people asked you for a good reproducible sample data sets – with the one … Read more

[Solved] find number of 1 and 0 combinations in two columns

January 6, 2023 by Kirat

Assuming you have a pandas dataframe, one option is to use pandas.crosstab to return another dataframe: import pandas as pd df = pd.read_csv(‘file.csv’) res = pd.crosstab(df[‘X’], df[‘Y’]) print(res) Y 0 1 X 0 3 7 1 1 3 A collections.Counter solution is also possible if a dictionary result is required: res = Counter(zip(df[‘X’].values, df[‘Y’].values)) 4 … Read more

[Solved] how to match string with dataframe in python [closed]

January 5, 2023 by Kirat

Perhaps you mean sth like this: words=” “.join(listvalue).upper().split() idx = df.value.str.upper().isin(words) In: df[idx] Out: variable value 48 Income salary 81 Shopping clothing 13 solved how to match string with dataframe in python [closed]

[Solved] Matplotlib graph adjusment with big dataset [closed]

January 3, 2023 by Kirat

Given this dataframe: df.head() complete mid_c mid_h mid_l mid_o time 0 True 0.80936 0.80943 0.80936 0.80943 2018-01-31 09:54:10+00:00 1 True 0.80942 0.80942 0.80937 0.80937 2018-01-31 09:54:20+00:00 2 True 0.80946 0.80946 0.80946 0.80946 2018-01-31 09:54:25+00:00 3 True 0.80942 0.80942 0.80940 0.80940 2018-01-31 09:54:30+00:00 4 True 0.80944 0.80944 0.80944 0.80944 2018-01-31 09:54:35+00:00 Create a 50 moving average: … Read more

[Solved] Sort the order of dataframe columns based on the values in the bottom row

December 31, 2022 by Kirat

You were close. Try this: import pandas as pd df = pd.DataFrame({‘a’: [1, 2, 3], ‘b’: [ 4, 5, 2], ‘c’: [2, 4, 5]}) print(df) df = df[[x for _, x in sorted(zip(df.iloc[-1], df.columns), reverse=True)]] print(df) Starting DataFrame: a b c 0 1 4 2 1 2 5 4 2 3 2 5 Columns sorted … Read more

[Solved] Calculate Year on Year, Quarter on Quarter, Month on month number of Repeated, new, lost customers & theri revenue using pandas/python

December 28, 2022 by Kirat

Calculate Year on Year, Quarter on Quarter, Month on month number of Repeated, new, lost customers & theri revenue using pandas/python solved Calculate Year on Year, Quarter on Quarter, Month on month number of Repeated, new, lost customers & theri revenue using pandas/python

[Solved] How to loop over an array of variables with the same form of name? [closed]

December 22, 2022 by Kirat

If you want to do all the columns in your dataframe: for col in df.columns: sns.countplot(df[col]) . Otherwise, following the pattern in your question: for i in range(1,11): column=’id_’+”{0}”.format(i).zfill(2) sns.countplot(df[column]) that will go through the numbers 1-10 and set column to the correct column name. zfill will make sure that for single digit numbers, column … Read more

[Solved] Add a new column with the list of values from all rows meeting a criterion

December 15, 2022 by Kirat

Something like this should work… df = pd.DataFrame({‘date’: [‘2017-01-01 01:01:01’, ‘2017-01-02 01:01:01’, ‘2017-01-03 01:01:01’, ‘2017-01-30 01:01:01’, ‘2017-01-31 01:01:01’], ‘value’: [99,98,97,95,94]}) df[‘date’] = pd.to_datetime(df[‘date’]) def get_list(row): subset = df[(row[‘date’] – df[‘date’] <= pd.to_timedelta(‘5 days’)) & (row[‘date’] – df[‘date’] >= pd.to_timedelta(‘0 days’))] return str(subset[‘value’].tolist()) df[‘list’] = df.apply(get_list, axis=1) Output: date value list 0 2017-01-01 01:01:01 99 [99] … Read more

[Solved] Python – How to Compare a column value of one row with value in next row

December 10, 2022 by Kirat

Use groupby (http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.groupby.html) Assume your input is saved in a pandas Dataframe (or equivalently save it into csv and read it using pandas.read_csv). Now you can loop over the groups with same S.No values with the following: output = {} for key, group in df.groupby(‘S.No.’): # print key # print group output[key] = {} output[key][‘Details’] … Read more