Which metric is commonly used to measure the performance of machine translation systems?
Accuracy
F1 score
BLEU score
Precision

Natural Language Processing Exercises are loading ...