求助求助

只看楼主
收藏
回复

啊飞不会飞啊
童生
2

网上搞了个代码，读取pdf中数据导出到txt中，但是导出结果中包含“NaN"和“/",想要将“NaN"和”/“替换成空格。求各位大神怎么操作。
# 事先安装好tabula,pandas：
import pandas as pd
pd.set_option('display.max_columns', 1000)
pd.set_option('display.width', 1000)
pd.set_option('display.max_colwidth', 1000)
import tabula
import sys
# 指定文件名；文件位置
file = "活页夹1.pdf"
path = 'C:/Users/邢燕飞/Desktop/' + file
#读取表格
df = tabula.read_pdf(path, pages = 'all', multiple_tables = True)
stdout=print(df)
with open('C:/Users/邢燕飞/Desktop/xxx.txt', 'w') as f:
print(df, file=f)

啊飞不会飞啊
童生
2

不感兴趣

开通SVIP免广告

悲婳煽
进士
9

df = df.replace(['/', np.nan], '')

啊飞不会飞啊
童生
2

救救孩子吧

扫二维码下载贴吧客户端

下载贴吧APP
看高清直播、视频！

贴吧页面意见反馈
违规贴吧举报反馈通道
贴吧违规信息处理公示

7回复贴，共1页

<<返回python吧

分享到:

日	一	二	三	四	五	六