Get top news based on content relevancy in 5 categories(Top Stories, Entertainment, Technology, Business, Health)
The project is developed in Python-Django framework, you can read complete implementation of project in documentation folder.
- Fetch news of various sources from https://newsapi.org/
- Categorize the news using naive bayes classifier(technology, entertainment, business, health)
- Sort the news according to content of news headlines and display on web browser
Content Selection Factors:
- Age of article(publication date)
- Source Quality
- Text Quality
- Sentiment Score
Implemented training model using Naive bayes, SVM and deep learning model using keras for the dataset, got highest accuracy for naive bayes classifier(0.89)