This document summarizes new directions for Spark in 2015, including developing high-level interfaces for data science similar to single-machine tools, platform interfaces to plug in external data sources and algorithms, machine learning pipelines inspired by scikit-learn, a R interface for Spark, and community packages of third-party libraries. The goal is to create a unified engine for Spark that can handle a variety of data sources, workloads, and environments.