← Back to all frameworks Machine Learning

Pandas + NumPy

The foundation of every Python data project

What it is

Pandas for tabular dataframes; NumPy for vectorized math. Together they're the data backbone — every model starts with these two doing the wrangling.

How Vaaani uses it

  • ETL: load CSV/Parquet/SQL → clean → join → save
  • Exploratory data analysis with groupby and pivot
  • Feature engineering: rolling windows, lags, encodings
  • Vectorized numerical pre-processing before model.fit()

Why it makes the cut

You can't do ML without these. The faster you read pandas idioms, the faster every project ships.

Sample code

import pandas as pd

df = pd.read_csv("orders.csv")
revenue = (df.groupby("region")["amount"]
             .sum()
             .sort_values(ascending=False))

Related in the Vaaani stack

Have a project that needs Pandas?

30-min discovery call. You describe the busywork; I map it to an AI worker and a budget.