Building an NBA betting model using Google Cloud Platform
With the NBA season coming to a close, the timing could not be worse - I have developed an NBA betting model web app. Although I don’t expect this model to ‘beat the books’, it’s an interesting academic exercise. It has been a good reason to dive deeper into some key software engineering practices.
The model is based largely on Dean Oliver’s Four Factors - the most important statistics that determine who will win a basketball game. Shooting, turnovers, rebounding and free throws. Other advanced team and player statistics will be included in the future to hopefully produce a more accurate model.
The NBA betting model is built using Google Cloud Platform and is completely serverless. NBA team and schedule data is scraped using python scripts taking advantage of packages such as BeautifulSoup
and are run using GCP’s Cloud Functions. The data is then processed to GCS buckets and subsequently BigQuery. The model is then written in SQL using BigQuery - with a scheduled query outputting json
to another GCS bucket via another Cloud Function invoked by a Pub/Sub Topic on a daily run. From here a static Vue website is hosted via Firebase with a compontent serving the json
via an Axios HTTP request.
This architecture does not use any compute resources and is unlikely to incur any billing costs in GCP.
I plan on writing additional posts detailing the code to scrape the NBA data, build the relevant pipelines and serve the ultimate line for daily matchups.