Import pandas and read in the Ecommerce Purchases csv file and set it to a DataFrame called ecom.
import pandas as pd
Check the head of the DataFrame.
df = pd.read_csv('C:/Users/ka030/Documents/GitHub/python_analysis/sources/Day4/workbook/Ecommerce Purchases.csv')
df.head()
How many rows and columns are there?
df.info()
What is the average Purchase Price?
df['Purchase Price'].mean()
What were the highest and lowest purchase prices?
df['Purchase Price'].max()
df['Purchase Price'].min()
How many people have English 'en' as their Language of choice on the website?
df[df['Language']=='en'].count()
How many people have the job title of "Lawyer" ?
df[df['Job']=='Lawyer'].count()
# df[df['Job']=='Lawyer'].value_counts()
How many people made the purchase during the AM and how many people made the purchase during PM ?
(Hint: Check out value_counts() )
df['AM or PM'].value_counts()
What are the 5 most common Job Titles?
df['Job'].value_counts().head()
Someone made a purchase that came from Lot: "90 WT" , what was the Purchase Price for this transaction?
df[df['Lot']=="90 WT"]['Purchase Price']
What is the email of the person with the following Credit Card Number: 4926535242672853
df[df['Credit Card']==4926535242672853]['Email']
How many people have American Express as their Credit Card Provider and made a purchase above $95 ?
df[(df['Purchase Price']>95) & (df['CC Provider']=="American Express")].count()
Hard: How many people have a credit card that expires in 2025?
df['CC Exp Date']
def cc_expire(x):
if '25' in x.split('/'):
# print(x)
return True
else:
return False
# cc_expire('02/25')
sum(df['CC Exp Date'].apply(lambda x: cc_expire(x)))
Hard: What are the top 5 most popular email providers/hosts (e.g. gmail.com, yahoo.com, etc...)
df['Email']
email_prov =[]
def e_provider(x):
result = x.split('@')
email_prov.append(result[1])
df['Email'].apply(lambda x:e_provider(x) )
# print(email_prov)
df['email_provider'] = email_prov
df.head()
df['email_provider'].value_counts().head()
Great Job!¶
'개발 > sk infosec cloud ai 전문가 양성과정' 카테고리의 다른 글
[CNN&비지도학습]SK infosec 클라우드 AI 전문가 양성과정 실습 (0) | 2020.09.08 |
---|---|
[CNN&비지도학습]SK infosec 클라우드 AI 전문가 양성과정 수업필기본 (0) | 2020.09.08 |
[pandas를 활용한 데이터분석]SK infosec 클라우드 AI 전문가 양성과정 수업필기본 (0) | 2020.09.08 |
[Pandas를 이용한 데이터 분석mnist-fashion, svm, decision tree]SK infosec 클라우드 AI 전문가 양성과정 수업 실습내용 (0) | 2020.09.08 |
[PYTHON데이터분석 2020/09/07-2] SK infosec 클라우드 AI 전문가 양성과정 수업필기본 (0) | 2020.09.07 |