2. CSV File ('movie_evaluation.csv’) with the following format.
? Column1: 'recordNum’
? Column 2: 'sentiment' (predicted sentiment: 0/1)
3. Written Report: - Microsoft PowerPoint format.
Assessment Criteria
1. Preprocessing of the Text [30%]
? Tokenization, stopwords removal, normalization + Python Program [20%]
? Text Vectorization approaches
2. Machine Learning Model {40%]
? Train/testing preparation + Python programs [5%]
? Machine Learning Model Training/Testing + Python programs [25%]
? Machine Learning Model Evaluation Accuracy [10%]
3. Summary of processes and results in written format [30%]
? Discuss the model performance using the different text vectorization approaches