BERT inference on G4 instances using Apache MXNet and GluonNLP: 1 million requests for 20 cents

Favorite Bidirectional Encoder Representations from Transformers (BERT) [1] has become one of the mo

You must Subscribe to read our archived content. Already subscribed? log in here.