matchzoo.preprocessors.bert_preprocessor

Bert Preprocessor.

Module Contents

class matchzoo.preprocessors.bert_preprocessor.BertPreprocessor(mode:str='bert-base-uncased')

Bases: matchzoo.engine.base_preprocessor.BasePreprocessor

Baisc preprocessor helper.

Parameters:mode – String, supported mode can be referred https://huggingface.co/pytorch-transformers/pretrained_models.html.
fit(self, data_pack:DataPack, verbose:int=1)

Tokenizer is all BertPreprocessor’s need.

transform(self, data_pack:DataPack, verbose:int=1)

Apply transformation on data.

Parameters:
  • data_pack – Inputs to be preprocessed.
  • verbose – Verbosity.
Returns:

Transformed data as DataPack object.