Skip to content

Latest commit

 

History

History
93 lines (81 loc) · 4.59 KB

README.md

File metadata and controls

93 lines (81 loc) · 4.59 KB

CMPD Traffic Helper (Traffic Analysis for Charlotte, NC)

Build Status Python 3.4 Python 3.5 Python 3.6 License: MIT

CMPD Traffic Alerts service for persistence data and also any predictions/model analysis. Updater service as well as traditional ML model utilities.

Goals of Project

  • Data collection of all reported traffic incidents in Charlotte
  • Identify areas of improvement in Charlotte traffic flow
  • Identify problems areas, specific roads needing attention
  • Identify least likelihood route from point A to point B of having an accident

Data/APIs used:

Install Instructions

This repo has multiple projects:

  • cmpd_accidents is for persistence and storing accidents as a command-line tool
  • traffic_analyzer is for model creation/generation/testing as a command-line tool
  • cloud_predict contains samples for interacting with existing cloud models via Google Cloud AI Platform
  • traffic_analyzer_api is an API gateway for hosting a local API for development purposes

Install locally:

pip install .

Install via PyPI:

pip install charlotte-traffic-analysis

Setup

Current usage:

  1. Setup persistence for storing data (MongoDB or MySQL currently supported)
  2. Setup database table or collections as accidents
  3. Setup OpenWeatherAPI account and API key
  4. Run any command line tools using your persistence connection and weather API key

That's it! All other data is stored as reference data from the latest census information via Charlotte NC. To update reference data, download from the Charlotte Data Portal and update your install locally if needed.

To check for current accidents and store them via MongoDB:

import cmpd_accidents as cmpd
cmpd.update_traffic_data(<MongoDB host>, <OpenWeather api key>) 

It is preferable to setup a job-type service to run the API incrementally over time. Setup cron job to run every 5 minutes:

*/5 * * * * cd <your-repo-location>/cmpd_accidents && sudo python3 main.py mongodb://<user>:<password>@<host>/<databasename> <OpenWeather api key>

Predicting Accident Likelihood

To run an existing model via Google Cloud AI navigate to cloud_predict and insert a sample prediction via command-separated features:

export GOOGLE_APPLICATION_CREDENTIALS=google_client_credentials.json && python predict.py <project> <model> <test data path> <target column>

Note on Persistence

If you would rather use a relational persistence such as MySQL, the interface is already available for SQLAlchemy connect via the database module. Simply replace the "collection" argument with "table" for relational persistence. Seed scripts are available in resources/db feel free to replace with what table columns you prefer.

Persistence swap example:

from cmpd_accidents import SQLAlchemyConnect
db = SQLAlchemyConnect(connection_string='mysql+pymysql://<user>:<password>@<host>/<database>')
with self.database as db:
            exist_events = db.find_ids(table="accidents", ids=current_ids, cursor_limit=500)
with self.database as db:
                db.insert_bulk(table="accidents", items=final_data) # persist data

Tests

python setup.py test

To-Do

  • Create API to use CMPD SOAP Service for latest traffic accident data
  • Setup generic persistence for use of different databases (MySQL, etc.)
  • Add integration tests
  • Setup Travis CI integration
  • Exploratory Data Analysis
  • Analyze existing traffic prediction models and develop mock model
  • Test mock models and provide detailed transparency
  • Utilize created model to provide insight for current traffic patterns and information
  • Scale to external cloud (Google AI)
  • Create Python web service via hosting solution to call mock model and integrate with web portal
  • Finalize and push Python package to PyPI
  • Fix any new bugs
  • Create web based portal with interactivity