How can I get the names of the features after applying OneHotEncode in ColumnTransformer using Sklearn Pipeline?

answered 2023-02-09 17:00:00 +0000

plato
56 ●5 ●5

You can access the names of the features generated by OneHotEncode in ColumnTransformer using the get_feature_names() method. Here's an example:

from sklearn.compose import ColumnTransformer
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import OneHotEncoder

# Define ColumnTransformer to apply OneHotEncoder to categorical features
preprocessor = ColumnTransformer(transformers=[('cat', OneHotEncoder(), ['gender', 'education'])])

# Define pipeline to apply preprocessor and other steps
pipeline = Pipeline(steps=[('preprocessor', preprocessor), ('classifier', LogisticRegression())])

# Fit pipeline to data
pipeline.fit(X_train, y_train)

# Get the names of the generated features
feature_names = pipeline.named_steps['preprocessor'].transformers_[0].named_steps['onehotencoder'].get_feature_names(['gender', 'education'])
print(feature_names)

In the example above, get_feature_names() is used to get the names of the generated features for the columns gender and education. The resulting feature_names list will contain the names of the generated features in the order they were created by the OneHotEncoder.

edit flag offensive delete link

add a comment

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account. This space is reserved only for answers. If you would like to engage in a discussion, please instead post a comment under the question or an answer that you would like to discuss

Add Answer

How can I get the names of the features after applying OneHotEncode in ColumnTransformer using Sklearn Pipeline?

1 Answer

Your Answer

Question Tools

Stats

Related questions

How can I get the names of the features after applying OneHotEncode in ColumnTransformer using Sklearn Pipeline? edit

1 Answer