Skip to content
/ wine Public

Exploratory Data Analysis (EDA) of Portuguese "vinho verde" wine dataset, examining physicochemical properties and quality scores.

Notifications You must be signed in to change notification settings

marcemir/wine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Exploratory Data Analysis (EDA) - Wine Quality

Objective

This repository contains a Jupyter notebook that presents an Exploratory Data Analysis (EDA) of a dataset consisting of physicochemical properties and quality (sensory) data for Portuguese "vinho verde" white and red wine samples.

The objective of this analysis is to understand the data and identify correlations between variables before applying a machine learning model to predict wine quality based on physicochemical characteristics.

The data was obtained from: Wine Quality Data

Input Variables - Physicochemical Analysis

  1. Fixed Acidity: The concentration of non-volatile acids, primarily tartaric, malic, and citric acids, contributing to the wine's tartness and stability.
  2. Volatile Acidity: The amount of acetic acid and other volatile acids present, which can affect the aroma and spoil the wine if too high.
  3. Citric Acid: A minor acid in wine that can add freshness and enhance the fruity flavors.
  4. Residual Sugar: The sugar remaining after fermentation, influencing sweetness and body.
  5. Chlorides: The chloride ion concentration, which can affect the taste and stability of the wine.
  6. Free Sulfur Dioxide: The portion of sulfur dioxide that is not bound to other compounds, acting as an antimicrobial and antioxidant.
  7. Total Sulfur Dioxide: The total amount of sulfur dioxide, both free and bound, used as a preservative.
  8. Density: The mass per unit volume of the wine, related to the alcohol and sugar content.
  9. pH: The measure of the acidity or basicity of the wine, affecting taste, color, and microbial stability.
  10. Sulphates: Compounds that can contribute to wine stability and preservation, also influencing mouthfeel.
  11. Alcohol: The ethanol content resulting from fermentation, affecting the wine's body, warmth, and preservation.
  12. Type: Categorical, Red or White Wine.

Output Variable - Sensory Data

  1. Quality: A sensory-based score ranging from 0 to 10, reflecting the overall perceived quality of the wine based on taste, aroma, balance, and overall enjoyment.

Acknowledgments


About

Exploratory Data Analysis (EDA) of Portuguese "vinho verde" wine dataset, examining physicochemical properties and quality scores.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published