python教程网

当前位置：首页 » 数据分析 » 正文

Python Pandas 对列/行进行选择，增加，删除操作

看: 1975次时间：2020-07-08 分类 : 数据分析

一、列操作

1.1 选择列

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
  'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
print (df ['one'])
# 选择其中一列进行显示，列长度为最长列的长度
# 除了 index 和 数据，还会显示 列表头名，和 数据 类型

运行结果：

a    1.0
b    2.0
c    3.0
d    NaN
Name: one, dtype: float64

1.2 增加列

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
  'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)

# Adding a new column to an existing DataFrame object with column label by passing new series

print ("Adding a new column by passing as Series:")
df['three']=pd.Series([10,30,20],index=['a','c','b'])
print(df)
# 增加列后进行显示，其中 index 用于对应到该列 元素 位置（所以位置可以不由 列表 中的顺序进行指定）

print ("Adding a new column using the existing columns in DataFrame:")
df['four']=df['one']+df['two']+df['three']
print(df)
# 我们选定列后，直接可以对整个列的元素进行批量运算操作，这里 NaN 与其他元素相加后，还是 NaN

运行结果：

Adding a new column by passing as Series:
   one two three
a 1.0    1   10.0
b 2.0    2   20.0
c 3.0    3   30.0
d NaN    4    NaN
Adding a new column using the existing columns in DataFrame:
   one two three four
a 1.0    1   10.0 12.0
b 2.0    2   20.0 24.0
c 3.0    3   30.0 36.0
d NaN    4    NaN   NaN

1.3 删除列（del 和 pop 函数）

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
  'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd']),
  'three' : pd.Series([10,20,30], index=['a','b','c'])}

df = pd.DataFrame(d)
print ("Our dataframe is:")
print(df)

# 使用 del 函数
print ("Deleting the first column using DEL function:")
del(df['one'])
print(df)

# 使用 pop 函数
print ("Deleting another column using POP function:")
df_2=df.pop('two') # 将一列 pop 到新的 dataframe
print(df_2)
print(df)

运行结果：

Our dataframe is:
   one two three
a 1.0    1   10.0
b 2.0    2   20.0
c 3.0    3   30.0
d NaN    4    NaN
Deleting the first column using DEL function:
   two three
a    1   10.0
b    2   20.0
c    3   30.0
d    4    NaN
Deleting another column using POP function:
   three
a   10.0
b   20.0
c   30.0
d    NaN
POP column:
a    1
b    2
c    3
d    4
Name: two, dtype: int64

二、行操作

2.1 选择行

2.1.1 通过 label 选择行（loc 函数）

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
  'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
print(df.loc['b']) # 显示这一行中，对应表头 下的 对应数据，同时显示 行 index 和 数据类型```

<p>运行结果：</p>
<blockquote>
<p>one    2.0<br>
two    2.0<br>
Name: b, dtype: float64<br>
</p>
</blockquote>
<p>2.1.2 通过序号选择行（iloc 函数）<br>
</p>

```python  
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
  'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
print(df.iloc[2]) # 序号 2 对应的是第 3 行的数据

运行结果：

one 3.0
two 3.0
Name: c, dtype: float64

2.1.3 通过序号选择行切片

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
  'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
print(df[2:4]) # 这里选择第 3 到 第 4 行，与 Python 切片一致，不需要函数，直接切片即可

运行结果：

   one two
c 3.0    3
d NaN    4

2.2 增加行（append 函数）

# 通过 append 函数
df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])
df2 = pd.DataFrame([[5, 6], [7, 8]], columns = ['a','b'])

df = df.append(df2)
print(df) # 这里相当于把 第二个 dataframe 与第一个进行拼接，默认的 index 都是 0 1
print(df.loc[0]) # 这里有两行的 index 是 0

运行结果：

a b
0 1 2
1 3 4
0 5 6
1 7 8
a b
0 1 2
0 5 6

2.3 删除行（drop 函数）

# 通过 drop 函数
df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])
df2 = pd.DataFrame([[5, 6], [7, 8]], columns = ['a','b'])

df = df.append(df2)

df = df.drop(0) # 这里有两个行标签为 0，所以直接删除了 2 行
print(df)

运行结果：

a b
1 3 4
1 7 8

到此这篇关于Python Pandas 对列/行进行选择，增加，删除操作的文章就介绍到这了,更多相关Python Pandas行列选择增加删除内容请搜索python博客以前的文章或继续浏览下面的相关文章希望大家以后多多支持python博客！

2021-12-20python数据挖掘使用Evidently创建机器学习模型仪表板
2021-12-20Python多进程共享numpy 数组的方法
2021-12-20python数据分析近年比特币价格涨幅趋势分布
2021-12-20python调用matlab的方法详解
2021-12-20python学习与数据挖掘应知应会的十大终端命令
2021-07-20pandas中NaN缺失值的处理方法
2021-07-20Python数据分析入门之数据读取与存储
2021-07-20Python 如何读取字典的所有键-值对
2021-07-20如何获取numpy的第一个非0元素索引
2021-07-20Python机器学习之KNN近邻算法

搜索

推荐资源

python教程网

当前位置：首页 » 数据分析 » 正文

Python Pandas 对列/行进行选择，增加，删除操作

看: 1975次 时间：2020-07-08 分类 : 数据分析

相关文章

搜索

推荐资源

看: 1975次时间：2020-07-08 分类 : 数据分析