This project aims to predict the selling prices of used Volkswagen cars. The goal is to build and evaluate a predictive model using various data analysis techniques in R.
The project is part of a knowledge assessment within the framework of studies in data analysis/data science.
- Data/: Storage for raw data files (e.g., Excel, CSV). Processed versions are also saved here.
- Models/: Saved models, such as trained regression models in .rds format.
- Script/: All R scripts, organized by purpose:
00_packages.R: Setup and installation of required packages01_data_cleaning.R: Data preparation and cleaning02_eda.R: Exploratory data analysis03_data_modeling.R: Creating the regression models04_model_diagnostics.R: Evaluating model performance05_model_selection.R: Comparing and selecting the best model06_model_testing.R: Testing the selected model07_predict_new_data.R: Using the model for predictionsSCB API: Use to download data
- Visualizations/: Finalized graphs and charts (PNG or PDF format).
# Core data manipulation and visualization library(tidyverse) library(ggplot2) library(readxl) library(stringr) # Data retrieval library(httr) library(jsonlite) library(pxweb) library(lubridate) # Analysis and modeling library(corrplot) library(caret) library(MASS) library(lmtest) library(car)See Script/install_packages.R for installation of necessary packages.
- Open
Kunskapskontroll.Rprojin RStudio. - Run the scripts in
Script/in the following order:00_packages.R01_data_cleaning.R02_eda.R03_data_modeling.R04_model_diagnostics.R05_model_selection.R06_model_testing.R07_predict_new_data.R
For questions, please contact: [email protected]