Theano is an open-source library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Theano employs a BSD 3-Clause license.

In the sections below, we provide example notebooks to demonstrate how to install Theano on Databricks using Cluster Node Initialization Scripts and how to run example Theano programs. See Integrating Deep Learning Libraries with Apache Spark for an example of integrating a deep learning library with Spark.


This guide is not a comprehensive guide on Theano. Refer to the Theano website.

Install Theano using an init script

Databricks recommends using Cluster Node Initialization Scripts to install Theano to make it available on all cluster nodes. The example notebook below installs an init script that installs Theano and enables GPU features.

Use Theano on a single node

To test and migrate single-machine Theano workflows, you can start with a driver-only cluster on Databricks by setting the number of workers to zero. Though Apache Spark is not functional under this setting, it is a cost-effective way to run single-machine Theano workflows.