kumoai.encoder.Hash#

class kumoai.encoder.Hash[source]#

Bases: Encoder

A Hash encoder encodes its corresponding categorical column by hashing each value to range [0..num_components], and using this hashed value to determine the corresponding embedding (with size channels from the model plan).

num_components: PositiveInt#

The number of distinct categories after hashing.

na_strategy: Literal[separate, most_frequent] = 'separate'#

The specified null value imputation strategy.

__init__(num_components, na_strategy=NAStrategy.SEPARATE, name='Hash', _stats=<factory>, _target_='kumo.encoder.categorical.Hash')#