Ask Your Question

Revision history [back]

Yes, it is possible to apply label encoding to a pandas dataframe while ensuring that the same label is assigned to the same value. This can be done using the LabelEncoder from the scikit-learn library.

First, initialize the LabelEncoder object and fit it to the column(s) of the dataframe using the fit method:

from sklearn.preprocessing import LabelEncoder

encoder = LabelEncoder()
encoder.fit(df['column_name'])

Next, use the transform method to encode the column(s):

df['encoded_column'] = encoder.transform(df['column_name'])

This will assign a unique numerical label to each unique value in the column, ensuring that the same label is assigned to the same value.