Question 56:
You work for a language translation software company. Your company needs to move from traditional translation software to a machine learning model-based approach that produces the translations accurately. One of your first tasks is to take text given in the form of a document and use a histogram to measure the occurrence of individual words in the document for use in document classification. Which of the following text feature engineering techniques is the best solution for this task?
Answer options:
A.Orthogonal Sparse Bigram (OSB) B.Term Frequency-Inverse Document Frequency (tf-idf) C.Bag-of-Words D.N-Gram