Py之PandasAI:PandasAI库的简介、安装、使用方法之详细攻略
目录
(1)、将多个数据框架传递给PandasAI并询问与之相关的问题
PandasAI库的简介
PandasAI被设计为与pandas一起使用。它使pandas具有对数据进行自然语言提问的能力。PandasAI是一个为pandas添加生成式人工智能功能的Python库,pandas是一种流行的数据分析和处理工具。它被设计为与pandas一起使用,并不是其替代品。
PandasAI库的安装
pip install pandasai
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple pandasai
PandasAI库的使用方法
1、基础用法
(1)、在DataFrame中找到所有列值大于5的行
可以向PandasAI询问,在DataFrame中找到所有列值大于5的行,并返回只包含这些行的DataFrame
import pandas as pd
from pandasai import PandasAI
# Sample DataFrame
df = pd.DataFrame({
"country": ["United States", "United Kingdom", "France", "Germany", "Italy", "Spain", "Canada", "Australia", "Japan", "China"],
"gdp": [19294482071552, 2891615567872, 2411255037952, 3435817336832, 1745433788416, 1181205135360, 1607402389504, 1490967855104, 4380756541440, 14631844184064],
"happiness_index": [6.94, 7.16, 6.66, 7.07, 6.38, 6.4, 7.23, 7.22, 5.87, 5.12]
})
# Instantiate a LLM
from pandasai.llm.openai import OpenAI
llm = OpenAI(api_token="YOUR_API_TOKEN")
pandas_ai = PandasAI(llm)
pandas_ai(df, prompt='Which are the 5 happiest countries?')
(2)、绘图
pandas_ai(
df,
"Plot the histogram of countries showing for each the gdp, using different colors for each bar",
)
2、进阶用法
(1)、将多个数据框架传递给PandasAI并询问与之相关的问题
import pandas as pd
from pandasai import PandasAI
employees_data = {
'EmployeeID': [1, 2, 3, 4, 5],
'Name': ['John', 'Emma', 'Liam', 'Olivia', 'William'],
'Department': ['HR', 'Sales', 'IT', 'Marketing', 'Finance']
}
salaries_data = {
'EmployeeID': [1, 2, 3, 4, 5],
'Salary': [5000, 6000, 4500, 7000, 5500]
}
employees_df = pd.DataFrame(employees_data)
salaries_df = pd.DataFrame(salaries_data)
llm = OpenAI()
pandas_ai = PandasAI(llm)
pandas_ai([employees_df, salaries_df], "Who gets paid the most?")