Skip to content

PEACH Lab Introduction

The purpose of this tutorial is to explain how to start working with PEACH Lab environment where users can use Jupyter notebooks to explore data and build recommendation algorithms.

Prerequisites

To be able to complete this tutorial, you will need an access to the PEACH Lab. It is required to have EBU GitLab account and permissions to use PEACH Lab. If you are new to the PEACH contact the team to get it solved for you.

Starting PEACH Lab

First you need to set up your notebook engine on PEACH platform, where you first sign in using GitLab account.

Spawner options

Before starting your PEACH Lab environment you are given a choice for some customization:

Choosing repos

  1. Cloning code repositories
    PEACH Lab is integrated with EBU GitLab and it is possible to clone repositories from GitLab into your Lab environment If you want to work with your repositories in PEACH Lab - now it's a good time to select them. Before that you need to tag your repositories on GitLab with label "peach-lab" (and refresh the page to get access to the new repositories):

    Tagging repos

    Select repositories which you would like to be cloned to your environment during the engine start. You can choose private (your personal) or public repositories (broadcaster/team level).

    Preferred way to organize the code is by creating folder notebooks/ in the root folder and placing your notebooks there, creating subfolders on per project basis when such need arises

  2. Generation new PEACH Lab from the template template

    Generate new project with suggested folder structure with an already defined task and endpoint to kickstart working with the PEACH Lab platform

About the Jupyterlab

The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more.

After spawning the environment, which may take around a minute, Jupyterlab window will have the following look:

Jupyterlab overview

  1. Git repositories you have access to. Includes common libraries, repositories selected during previous step and optionally your scaffolded repository
  2. Overview notebooks to view status of tasks and endpoints, notebook to validate configuration files
  3. To start interactive Jupyter notebook session with PEACH environment, including installed dependencies with access to Redis and Spark environments. Supports both python 2 and 3, but in tasks and endpoints only version 2 is supported right now

Git integration

PEACH Lab is integrated with EBU GitLab so you can perform various operations inside git repository using UI:

  • pull
  • create branches
  • revise changes in history
  • diff notebooks
  • stage changes for commit
  • commit
  • push

Jupyterlab overview