» KerasNLP


KerasNLP is a simple and powerful API for building Natural Language Processing (NLP) models within the Keras ecosystem.

KerasNLP provides modular building blocks following standard Keras interfaces (layers, metrics) that allow you to quickly and flexibly iterate on your task. Engineers working in applied NLP can leverage the library to assemble training and inference pipelines that are both state-of-the-art and production-grade.

KerasNLP can be understood as a horizontal extension of the Keras API: components are first-party Keras objects that are too specialized to be added to core Keras, but that receive the same level of polish as the rest of the Keras API.

KerasNLP is also new and growing! If you are interested in contributing, please check out our contributing guide.


KerasNLP requires Python 3.7+ and TensorFlow 2.9+.

Install the latest release:

pip install keras-nlp --upgrade

You can check out release notes and versions on our releases page.

KerasNLP is currently in pre-release (0.y.z) development. Until version 1.0, we may break compatibility at any time and APIs should not be considered stable.

Quick introduction

The following snippet will tokenize some text, build a tiny transformer, and train a single batch.

import keras_nlp
import tensorflow as tf
from tensorflow import keras

# Tokenize some inputs with a binary label.
vocab = ["[UNK]", "the", "qu", "##ick", "br", "##own", "fox", "."]
sentences = ["The quick brown fox jumped.", "The fox slept."]
tokenizer = keras_nlp.tokenizers.WordPieceTokenizer(
x, y = tokenizer(sentences), tf.constant([1, 0])

# Create a tiny transformer.
inputs = keras.Input(shape=(None,), dtype="int32")
outputs = keras_nlp.layers.TokenAndPositionEmbedding(
outputs = keras_nlp.layers.TransformerEncoder(
outputs = keras.layers.GlobalAveragePooling1D()(outputs)
outputs = keras.layers.Dense(1, activation="sigmoid")(outputs)
model = keras.Model(inputs, outputs)

# Run a single batch of gradient descent.
model.compile(optimizer="rmsprop", loss="binary_crossentropy", jit_compile=True)
model.train_on_batch(x, y)

To see an end-to-end example using KerasNLP, check out our guide on pre-training a transfomer from scratch.

Citing KerasNLP

If KerasNLP helps your research, we appreciate your citations. Here is the BibTeX entry:

  author={Watson, Matthew, and Qian, Chen, and Zhu, Scott and Chollet, Fran\c{c}ois and others},