This project was created for the CANIS Data Analysis Hackathon. The task was to create compelling data visualizations that would help others understand the survey results in a meaningful way. The dataset used in this project is available on Kaggle through this link.
 The purpose of this project is to analyze a dataset of news articles and identify the words that are most frequently used in true and fake news. Our goal is to gain insights into the language patterns that are associated with fake news and to develop techniques for detecting and combating misinformation.  
        To achieve this, we will perform data analysis and NLP techniques on the dataset, extract meaningful insights, and create data visualizations to present our findings in an understandable and meaningful way.  
        By identifying the key language features of fake news, we hope to contribute to the development of more effective strategies for combating the spread of misinformation and promoting the dissemination of accurate and reliable information.
The project involves data analysis of the misinformation fake news text dataset. The project includes the following steps:
The code for this project is written in Python and requires the following dependencies:
To install the dependencies, run the following commands:
pip install pandas
pip install nltk
pip install textblob
pip install matplotlib
    
  
    After Python script that simplified the datasets based on word frequency. From there, our team of data analysts used advanced techniques and R code to analyze the data and identify patterns that could distinguish between fake and true news.
To create our fake news detector, we collected all possible word sets that were likely to be associated with fake news and used this data to make the detector. Now, if a news article contains a certain number of these words, our detector can quickly flag it as fake news. It's like having a trusty assistant by our side, helping us navigate the murky waters of information overload and stay vigilant against the spread of false information! 🕵️♀️📰