pandas Archives - Page 6 of 10

[Solved] Compare two dataframes and update the dataframe if the data is different [closed]

October 13, 2022 by Kirat

If I understand the logic correctly . . . # imports import pandas as pd from io import StringIO # sample data s1 = “””id Name score 111 Jack 2.17 112 Nick 1.11 113 Zoe 4.12″”” s2 = “””id Name score 111 Jack 2.17 112 Sick 1.10 113 Zoe 4.12 114 Jay 12.3″”” df1 = … Read more

[Solved] How to display a row based on specific columns and values

October 13, 2022 by Kirat

You can select within a column by: df.loc[(df[‘Killed’] > 7) & (df.index == ‘Dog’)] 3 solved How to display a row based on specific columns and values

[Solved] Aggregate symmetric pairs pandas

October 11, 2022 by Kirat

I used to have the same problem before , And this is my solution df1=df[[‘X’,’Y’]].apply(sorted,1) df.groupby([df1.X,df1.Y])[‘count’].sum().reset_index(name=”count”) Out[400]: X Y count 0 A B 3 1 C D 8 solved Aggregate symmetric pairs pandas

[Solved] Compare values under multiple conditions of one column in Python

October 10, 2022 by Kirat

Try: #Use pd.Categorical to ensure sorting if column is not lexicographical ordered. df[‘type’] = pd.Categorical(df[‘type’], ordered=True, categories=[‘s1′,’s2′,’s3’]) df[‘result’] = df.sort_values(‘type’).groupby(‘name’)[‘value’].diff(-1) df[‘result’] = df[‘result’].lt(0).mask(df[‘result’].isna(),”) df Output: index name type value result 0 1 A s1 20 False 1 2 A s2 10 2 3 B s1 18 True 3 4 B s2 32 False 4 5 … Read more

[Solved] Replacing specific columns with a text [closed]

October 10, 2022 by Kirat

I am not going to give the whole code as it is not that difficult to write. Just write is as you described in your question, just specify the parameter how (viz pandas.DataFrame.merge). The default is inner which causes the lost rows as it merge only on rows that exists in both dataframes. From your … Read more

[Solved] Replace every value in a pandas dataframe series [duplicate]

October 10, 2022 by Kirat

Your pattern matches multiple positions. One before a character (including a character) and one right after. You can test it here. If you include a start string ancor it will work to match anything (even empty strings) and replace with Test ^.* solved Replace every value in a pandas dataframe series [duplicate]

[Solved] Another Traceback Error When I Run My Python Code

October 9, 2022 by Kirat

You just have to many brackets ((df[‘Location’].str.contains(‘- Display’) & df[‘Lancaster’] == ” & df[‘Dakota’] == ‘D’ & df[‘Spitfire’] == ‘SS’ & df[‘Hurricane’] == ”)) You needed to remove a ‘)’ after each (‘- Display’) it looks like you will still have some problems with sorting through your data. But this should get you past your … Read more

[Solved] how to encode only categorical data in a dataframe

October 9, 2022 by Kirat

# Using standard scikit-learn label encoder. from sklearn.preprocessing import LabelEncoder le = LabelEncoder() # Encode all string columns. Assuming all categoricals are of type str. for c in df.select_dtypes([‘object’]): print “Encoding column ” + c df[c] = le.fit_transform(df[c]) 3 solved how to encode only categorical data in a dataframe

[Solved] Customize axes in Matplotlib

October 8, 2022 by Kirat

You can display subscripts by writing your column names using LaTex: import pandas as pd import matplotlib.pyplot as plt df = pd.DataFrame( { 0: { “Method 1”: 31.7, “Method 2”: 44.2, “Method 3”: 75.6, “Method 4”: 87.5, “Method 5”: 88.6, “Method 6”: 100.0, }, 1: { “Method 1”: 32.9, “Method 2”: 45.4, “Method 3”: 72.2, … Read more

[Solved] Pandas: get json from data frame

October 8, 2022 by Kirat

You can use: #convert int xolum to string df[‘member_id’] = df.member_id.astype(str) #reshaping and convert to months period df.set_index(‘member_id’, inplace=True) df = df.unstack().reset_index(name=”val”).rename(columns={‘level_0′:’date’}) df[‘date’] = pd.to_datetime(df.date).dt.to_period(‘m’).dt.strftime(‘%Y-%m’) #groupby by date and member_id and aggregate sum df = df.groupby([‘date’,’member_id’])[‘val’].sum() #convert all values !=0 to 1 df = (df != 0).astype(int).reset_index() #working in pandas 0.18.1 d = df.groupby(‘member_id’)[‘date’, ‘val’].apply(lambda … Read more

[Solved] Installing python packages offline [closed]

October 7, 2022 by Kirat

go and download packages from pypi. After that transport packages to your offline pc. open cmd and use this command. “pip install [Your packages path]”. 0 solved Installing python packages offline [closed]

[Solved] Using yield in nested loop

October 6, 2022 by Kirat

As you’ve been told in the comments, I also don’t think you can save memory using yield in this case. However, if you only want to know how to use yield, this is one of the options: import pandas as pd data = [{ “id”: 123, “sports”: { “football”: { “amount”: 3, “count”: 54 }, … Read more

[Solved] How to separate the contents of parentheses and make a new dataframe column? [closed]

October 6, 2022 by Kirat

Seems like str.extract would work assuming the seat number is the numeric characters before 席 and the seat arrangement is the values inside the parenthesis: import numpy as np import pandas as pd df = pd.DataFrame({ ‘seat’: [’45席（1階カウンター4席、６〜８人テーブル１席2階地下それぞれ最大20人）’, np.nan, np.nan, np.nan, ‘9席（カウンター9席、個室4席）’] }) new_df = df[‘seat’].str.extract(r'(\d+)席（(.*)）’, expand=True) new_df.columns = [‘seat number’, ‘seat arrangement’] new_df: seat … Read more

[Solved] FREQUENCY BAR CHART OF A DATE COLUMN IN AN ASCENDING ORDER OF DATES

October 6, 2022 by Kirat

import pandas as pd import matplotlib.pyplot as plt data = pd.read_csv(‘dataset.csv’) data[‘sample_date’] = pd.to_datetime(data[‘sample_date’]) data[‘sample_date’].value_counts().sort_index().plot(kind=’bar’) # Use sort_index() plt.tight_layout() plt.show() 0 solved FREQUENCY BAR CHART OF A DATE COLUMN IN AN ASCENDING ORDER OF DATES

[Solved] python – how do i assign columns to my dataframe?

October 6, 2022 by Kirat

import pandas as pd varnames = [‘Student_id’,’First_Name’,’Last_Name’,’Grade’] values = [[‘156841′,’Mark’,’Smith’,’85’], [‘785496′,’Jason’,’Gross’,’90’], [‘785612′,’Laura’,’Clarkson’,’76’], [‘125465′,’Tria’,’Carr’,’100′]] data1 = pd.DataFrame(values, columns=varnames) data1 solved python – how do i assign columns to my dataframe?