参数名称

Rnn_dim

128

Max_seq_length

64

Train_batch_size

64

Eval_batch_size

64

Gradient_accumulation_steps

1

Learning_rate

3e-5

Logging_steps

500