PEACH Lab Introduction
The purpose of this tutorial is to explain how to start working with PEACH Lab environment where users can use Jupyter notebooks to explore data and build recommendation algorithms.
To be able to complete this tutorial, you will need an access to the PEACH Lab. It is required to have EBU GitLab account and permissions to use PEACH Lab. If you are new to the PEACH contact the team to get it solved for you.
Starting PEACH Lab
Before starting your PEACH Lab environment you are given a choice for some customization:
Cloning code repositories
PEACH Lab is integrated with EBU GitLab and it is possible to clone repositories from GitLab into your Lab environment If you want to work with your repositories in PEACH Lab - now it's a good time to select them. Before that you need to tag your repositories on GitLab with label "peach-lab" (and refresh the page to get access to the new repositories):
Select repositories which you would like to be cloned to your environment during the engine start. You can choose private (your personal) or public repositories (broadcaster/team level).
Preferred way to organize the code is by creating folder
notebooks/in the root folder and placing your notebooks there, creating subfolders on per project basis when such need arises
Generation new PEACH Lab from the template
Generate new project with suggested folder structure with an already defined task and endpoint to kickstart working with the PEACH Lab platform
About the Jupyterlab
The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more.
After spawning the environment, which may take around a minute, Jupyterlab window will have the following look:
- Git repositories you have access to. Includes common libraries, repositories selected during previous step and optionally your scaffolded repository
- Overview notebooks to view status of tasks and endpoints, notebook to validate configuration files
- To start interactive Jupyter notebook session with PEACH environment, including installed dependencies with access to Redis and Spark environments. Supports both python 2 and 3, but in tasks and endpoints only version 2 is supported right now
PEACH Lab is integrated with EBU GitLab so you can perform various operations inside git repository using UI:
- create branches
- revise changes in history
- diff notebooks
- stage changes for commit