- Add a function to check if the user has entered a valid username and password. I need to add some error handling to talke care of bad values.
- Add a wrapper to the download function so it can be easily used from the command line
- Finish the function to load the data into Pandas data frames. Maybe store all the dataframes in a class or fictionary?
- Add code to create a MySQL database and load all the data.
- Create a sparse matrix storing the ICD codes for each admission ID. Figure out how to "One-Hot Encode" it for predictive modeling purposes.
- Try using PCA or Tensor Factorization to identify major disease groups.