How to Iterate Over Rows in a Dataframe in Pandas

2 min read

1. Using the iterrows()
2. Using the itertuples()
3. Using the apply()

Pandas is a powerful library for working with data in Python, and the DataFrame is one of its most widely used data structures. One common task when working with DataFrames is to iterate over the rows and perform some action on each row.

Here are a few different approaches for iterating over rows in a DataFrame in Pandas:

1. Using the `iterrows()`

This method returns an iterator that yields index and row data as a tuple for each row. The row data is represented as a Pandas Series.

Here is an example of how to use the iterrows() method:

import pandas as pd

df = pd.read_csv('data.csv')
for index, row in df.iterrows():
    print(row['column_name'])

2. Using the `itertuples()`

This method returns an iterator that yields namedtuples of the rows. The namedtuples have fields corresponding to the column names. This method is generally faster than iterrows() as it doesn't construct a new Pandas Series for each row.

Here is an example of how to use the itertuples() method:

import pandas as pd

df = pd.read_csv('data.csv')
for row in df.itertuples():
    print(row.column_name)

3. Using the `apply()`

This method applies a function to each row or column of the DataFrame. The function can be passed as an argument and is applied to each row, and the results are combined into a new DataFrame.

Here is an example of how to use the apply() method to iterate over rows:

import pandas as pd

df = pd.read_csv('data.csv')
def my_function(row):
    print(row['column_name'])
df.apply(my_function, axis=1)

It's important to note that when working with large datasets, iterating over rows using iterrows() or a for loop can be slow, so itertuples() and apply() are better options performance wise.

In summary, there are several approaches to iterate over rows in a DataFrame in Pandas, and the best approach will depend on the specific needs of your project. The iterrows() and itertuples() methods are easy to use and understand, while apply() method provides more control over applying a specific function to each row and the for loop is the most basic method.

PYTHON

Support Our Site

How to Iterate Over Rows in a Dataframe in Pandas

Table of Contents

1. Using the `iterrows()`

2. Using the `itertuples()`

3. Using the `apply()`

Latest Articles

How to Use Subquery() in Django With Practical Examples

DRF Serializer: Handling OrderedDict and Converting It to a Dictionary or JSON

Django Rest Framework CheetSheet: Mastering API Development

How to Perform NOT Queries in Django ORM

How to Iterate Over Rows in a Dataframe in Pandas

Table of Contents

1. Using the iterrows()

2. Using the itertuples()

3. Using the apply()

Latest Articles

How to Use Subquery() in Django With Practical Examples

DRF Serializer: Handling OrderedDict and Converting It to a Dictionary or JSON

Django Rest Framework CheetSheet: Mastering API Development

How to Perform NOT Queries in Django ORM

1. Using the `iterrows()`

2. Using the `itertuples()`

3. Using the `apply()`