当前位置：首页 > news >正文

【数据分析】python-pandas速查文档（3）

news 2026/5/28 18:59:30

python-pandas速查文档（3）

博客主页：源码速查
本文档共 4 部分，当前为第 3 部分

文章目录

python-pandas速查文档（3）
- 六、公开方法速查（按功能分类）
- - - equals()：对象相等判断
    - eval()：表达式计算
    - factorize()：因子编码
    - floor()：时间向下取整
    - floordiv()：整除运算
    - flush()：刷新到磁盘
    - format()：格式化输出
    - fromordinal()：序数转时间戳
    - fromtimestamp()：秒数转时间戳
    - ge()：大于等于比较
    - gt()：大于比较
    - holds_integer()：是否存整数值
    - identical()：严格相同判断
    - indexer_at_time()：特定时间定位
    - infer_freq()：推断频率
    - infer_objects()：推断对象类型
    - intersection()：交集
    - is_boolean()：布尔类型判断
    - is_categorical()：分类类型判断
    - is_dtype()：类型匹配判断
    - is_floating()：浮点类型判断
    - is_integer()：整数类型判断
    - is_numeric()：数值类型判断
    - is_object()：对象类型判断
    - isetitem()：按位置设置值
    - isna()：缺失值检测
    - isnull()：缺失值检测（同义）
    - isocalendar()：ISO日历信息
    - isoweekday()：ISO星期序号
    - items()：逐列迭代
    - iterrows()：逐行迭代
    - itertuples()：命名元组迭代
    - keys()：HDF5键列表
    - kurt()：峰度系数
    - kurtosis()：峰度（同义）
    - le()：小于等于比较
    - list()：列表访问器入口
    - lreshape()：宽表转长表
    - lt()：小于比较
    - mod()：取余运算
    - mode()：众数
    - month_name()：月份名称
    - mul()：乘法
    - multiply()：乘法（同义）
    - ne()：不等于比较
    - nlargest()：最大的N个值
    - notna()：非缺失判断
    - notnull()：非缺失（同义）
    - now()：当前时间戳
    - nsmallest()：最小的N个值
    - nunique()：唯一值数量
    - parse()：解析Excel工作表
    - pop()：移除并返回列
    - pow()：幂运算
    - qcut()：分位数分箱
    - radd()：反向加法
    - ravel()：展平为一维数组
    - rdiv()：反向除法
- 八、快速索引
- - 方法快速索引
- 九、参考资料

【pandas 公开成员统计】：共 41 个公开类，628 个公开方法，239 个公开属性

六、公开方法速查（按功能分类）

本部分继续列出公开方法详细条目。

equals()：对象相等判断

Index.equals(self, other: 'object') -> 'bool'

归属：pandas.Index

说明：判断两个 Index 对象是否完全相等（包括值、名称、dtype）。

importpandasaspd idx1=pd.Index(["北京","上海","广州"],name="城市")idx2=pd.Index(["北京","上海","广州"],name="城市")idx3=pd.Index(["北京","上海","深圳"],name="城市")print("相同:",idx1.equals(idx2),"不同:",idx1.equals(idx3))# 输出: True False

eval()：表达式计算

DataFrame.eval(self, expr: 'str', *, inplace: 'bool' = False, **kwargs) -> 'Any | None'

归属：pandas.DataFrame

说明：在 DataFrame 列上执行字符串表达式计算，比手动写列运算更简洁且支持多核加速。

importpandasaspd 商品表=pd.DataFrame({"单价":[100,200,50],"数量":[3,1,10]})商品表.eval("金额 = 单价 * 数量",inplace=True)print(商品表)# 输出:# 单价 数量 金额# 0 100 3 300# 1 200 1 200# 2 50 10 500

归属：pandas

说明：pandas 顶层 eval，支持对局部或全局变量的表达式求值。

importpandasaspd df=pd.DataFrame({"A":[1,2,3],"B":[4,5,6]})result=pd.eval("df.A + df.B")print(result.tolist())# 输出: [5, 7, 9]

factorize()：因子编码

pd.factorize(values, sort: 'bool' = False, use_na_sentinel: 'bool' = True, size_hint: 'int | None' = None) -> 'tuple[np.ndarray, np.ndarray | Index]'

归属：pandas

说明：将字符串/分类值编码为整数标签和唯一值映射，适合替代 sklearn 的 LabelEncoder。

importpandasaspd 城市列表=["北京","上海","北京","广州","上海"]编码,唯一值=pd.factorize(城市列表)print("整数编码:",编码)# 输出: [0, 1, 0, 2, 1]print("唯一值:",唯一值.tolist())# 输出: ['北京', '上海', '广州']

floor()：时间向下取整

DatetimeIndex.floor(self, *args, **kwargs)

归属：pandas.DatetimeIndex

说明：将 DatetimeIndex 中的时间向下对齐到指定频率（小时/天/周等）。

importpandasaspd 日期=pd.DatetimeIndex(["2026-05-28 15:45:30","2026-05-29 23:10:00"])print("以小时取整:",日期.floor("h").tolist())# 输出: [Timestamp('2026-05-28 15:00'), Timestamp('2026-05-29 23:00')]

归属：pandas.Timedelta

说明：将 Timedelta 向下取整到指定精度。

importpandasaspd td=pd.Timedelta("3 hours 45 minutes")print(td.floor("h"))# 输出: 0 days 03:00:00

归属：pandas.Timestamp

说明：将时间戳向下取整到指定频率。

importpandasaspd ts=pd.Timestamp("2026-05-28 15:45:30")print(ts.floor("h"))# 输出: 2026-05-28 15:00:00

floordiv()：整除运算

DataFrame.floordiv(self, other, axis: 'Axis' = 'columns', level=None, fill_value=None) -> 'DataFrame'

归属：pandas.DataFrame

说明：对 DataFrame 逐元素执行整除//运算。

importpandasaspd df=pd.DataFrame({"总量":[100,250,333],"每组人数":[3,5,10]})df["组数"]=df["总量"].floordiv(df["每组人数"])print(df)# 输出:# 总量 每组人数 组数# 0 100 3 33# 1 250 5 50# 2 333 10 33

flush()：刷新到磁盘

HDFStore.flush(self, fsync: 'bool' = False) -> 'None'

归属：pandas.HDFStore

说明：将 HDF5 存储中的缓冲区数据强制写入磁盘，适合断电或异常安全场景。

frompathlibimportPathimportpandasaspd 路径=Path("测试.h5")store=pd.HDFStore(路径,mode="w")store["data"]=pd.DataFrame({"x":[1,2]})store.flush(fsync=True)store.close()路径.unlink(missing_ok=True)

format()：格式化输出

Index.format(self, name: 'bool' = False, formatter: 'Callable | None' = None, na_rep: 'str_t' = 'NaN') -> 'list[str_t]'

归属：pandas.Index

说明：将 Index 各元素转为格式化后的字符串列表，支持自定义格式化函数。

importpandasaspd 日期=pd.DatetimeIndex(["2026-05-28","2026-05-29"],name="日期")print(日期.format(formatter=lambdax:x.strftime("%Y/%m/%d")))# 输出: ['2026/05/28', '2026/05/29']

fromordinal()：序数转时间戳

Timestamp.fromordinal(cls, ordinal, tz=None)

归属：pandas.Timestamp

说明：从 Python datetime 的序数（公历日序号）构造 Timestamp。

importpandasaspd ts=pd.Timestamp.fromordinal(738613)# 大约对应 2026-05-28print(ts)# 输出: 约 2026-05-28 00:00:00

fromtimestamp()：秒数转时间戳

Timestamp.fromtimestamp(cls, ts, tz=None)

归属：pandas.Timestamp

说明：从 Unix 时间戳（秒级浮点数）构造 Timestamp。

importpandasaspd ts=pd.Timestamp.fromtimestamp(1700000000)print(ts)# 输出: 约 2023-11-15

ge()：大于等于比较

DataFrame.ge(self, other, axis: 'Axis' = 'columns', level=None) -> 'DataFrame'

归属：pandas.DataFrame

说明：逐元素比较是否 >= 给定值，返回布尔 DataFrame。

importpandasaspd 成绩表=pd.DataFrame({"语文":[85,90,78],"数学":[88,92,80]})及格=成绩表.ge(80)print(及格)# 输出:# 语文 数学# 0 True True# 1 True True# 2 False True

归属：pandas.Series

说明：逐元素比较是否 >= 给定值。

importpandasaspd s=pd.Series([120,80,150],name="销售额")print(s.ge(100))# 输出:# 0 True# 1 False# 2 True

gt()：大于比较

DataFrame.gt(self, other, axis: 'Axis' = 'columns', level=None) -> 'DataFrame'

归属：pandas.DataFrame

说明：逐元素比较是否 > 给定值。

importpandasaspd df=pd.DataFrame({"销量":[120,80,150]})print(df["销量"].gt(100).tolist())# 输出: [True, False, True]

holds_integer()：是否存整数值

Index.holds_integer(self) -> 'bool'

归属：pandas.Index

说明：判断 Index 中存储的元素是否可无损转为整数。

importpandasaspd 浮点索引=pd.Index([1.0,2.0,3.0])整数索引=pd.Index([1,2,3])print("浮点存整数:",浮点索引.holds_integer(),"整数存整数:",整数索引.holds_integer())# 输出: True True

identical()：严格相同判断

Index.identical(self, other) -> 'bool'

归属：pandas.Index

说明：判断两个 Index 是否从头到尾完全相同（比 equals 更严格，检查名称和属性）。

importpandasaspd idx1=pd.Index([1,2,3

查看全文

http://www.zskr.cn/news/1416153.html

从零打造蓝牙遥控履带车：Arduino、3D打印与FPV系统全解析

Sora 2 AI主播生成全链路拆解：从提示词工程、语音驱动到唇形同步的7大关键技术突破

选择题专练数据库原理精选30题

2025泉州除甲醛公司Top5深度测评：绿舒环保稳居榜首 - 绿舒环保母婴除甲醛

基于Raspberry Pi Pico W的Wi-Fi邮件报警系统设计与实现

Ubuntu 20.04 新手必看：刚装完系统，ifconfig和vim都用不了？5分钟搞定镜像源和常用工具安装

ZYZ28 2026.5.26 Round 记录

专业开发者指南：使用pywencai高效获取同花顺问财金融数据

Go语言跨平台数据库开发：实现跨平台数据持久化

选择题专练数据库原理精选30题[答案]

Arduino模拟信号控制实战：电位器PWM调控电机与LED

Arduino智能垃圾桶实战：超声波感应与舵机控制全解析

产品设计思维转变：从功能堆砌到问题消除，提升用户体验与留存率

深度解析DJI DroneID信号解码技术：从OFDM调制到完整解调实战指南

爷青回！用Win10和家人在家联机《龙之崛起》的保姆级教程（附1.01宽屏版资源）

5大技术革新重构缠论量化：ChanVis几何交易可视化系统

x3daudio1_7.dll 缺失导致游戏没声音或闪退？DirectX 音频组件这样查

如何快速下载三星官方固件：Bifrost跨平台固件管理完整指南

如何快速实现专业级数字动画：CountUp.js 完整解决方案

OpenHuman 本地 AI 桌面管家｜从零部署

对比按需计费与Token Plan套餐在长期项目中的成本体感差异

终极Steam数据获取指南：GetDataFromSteam-SteamDB完整教程

Amphenol ICC MSPEC6P2AK010线束组件解析及替代方案参考

NAT网关（一）NAT网关与西门子PN/PN耦合器的本质区

rabbitmq（2）：消息可靠性与 SpringAMQP 实战总结

从查重到消 AI 痕，Paperxie 如何解决论文毕业季的两大核心痛点

5个技巧让B站视频下载效率翻倍：哔哩下载姬downkyi完全指南

全国钢模板厂家实测排行：基于工程场景的性能与服务对比 - 奔跑123

Claude长文本处理卡顿诊断指南（含火焰图分析+KV Cache内存泄漏定位工具链）

AbMole 小讲堂丨Artemisinin：青蒿素在氧化应激与铁代谢研究中的应用

python-pandas速查文档（3）

文章目录

六、公开方法速查（按功能分类）

equals()：对象相等判断

eval()：表达式计算

factorize()：因子编码

floor()：时间向下取整

floordiv()：整除运算

flush()：刷新到磁盘

format()：格式化输出

fromordinal()：序数转时间戳

fromtimestamp()：秒数转时间戳

ge()：大于等于比较

gt()：大于比较

holds_integer()：是否存整数值

identical()：严格相同判断

相关文章：