Machine Learning

At Plank, we’re committed to creating the next generation of engineers who are as obsessed with computer science as we are.

Machine Learning

sprint cycle

The Plank machine learning lab consists of dedicated teams of data scientists, data engineers and mlops engineers that work with innovative technology companies on their most difficult challenges.

Step 1
Business problem formulation
  • Interviewing stakeholders and documenting research
  • Objective formulation of business problem along with key metrics
  • Defining successful outcome concretely
Excel,Wiki
Step 2
Data cleaning and data pipeline engineering
  • Consolidation of input data sources and schema
  • Data analysis to assess missing data, mistyped data
  • Creation of consolidated / clean schema of data
  • Creating data pipelines 
  • Deploying data pipeline and batch jobs to populate data lake
ETL,DBT,Apache Spark,Snowflake
Step 3
Data Science & Model training
  • Creating training, test, validation datasets
  • Formulating model experiments and tracking model performance
  • Tuning of model hyper-parameters
  • Optimization of model performance
Pytorch,Pandas,Scikit-learn,Tensorflow
Step 4
ML model coding and deployment
  • Implementation of prediction API endpoints
  • Creation and deployment of model microservices
  • Tracking of model performance
  • Setup regular model retraining
  • Model versioning
MLFlow,Kubernetes,Kubeflow,Sagemaker
Step 5
Model AB testing
  • Integration with data warehouse and analytics services for continuous model performance evaluation
  • Defining model variants and AB testing experiments
  • Mapping model performance to key business metrics
Python,SplitIO
Step 6

Customer Showcase

Achaemenid

Achaemenid
Machine Learning

Achaemenid is a medical technology company that develops innovative products for non-invasive diagnostics, therapeutic, as well as those for management of patients with mild to severe obstructive sleep apnea. Founded by Reza Radmand, DMD, Achaemenid is focused on patient-centric healthcare-related solutions.

junction-ai

junction-ai
Machine Learning

Junction AI fine tunes the last mile of the supply chain for successful, predictable, and profitable product merchandising. AI success depends on the right infrastructure, tools & data modeling. Junction AI’s proprietary AI/ML platform is an end-to-end data transformation solution. From ingesting structured & unstructured data, cleansing & preprocessing, modeling & insight delivery we’ve combined everything you need for AI/ML success.

Migo

Migo
Machine Learning

Migo is an embedded lending platform that enables companies to extend credit to consumers and small businesses in their own apps. Migo builds proprietary ML algorithms to assess credit risk using the company’s data then automates credit facilities via cutting-edge cloud infrastructure, simplifying the complex world of lending with a simple API. Raised $50million from TPG, Valor Capital, and Nyca Partners

Neuruno

Neuruno
Machine Learning

Engineers we’ve worked with

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut morbi bibendum interdum sit. Tincidunt magna nisi, a amet sit enim dui accumsan. Phasellus in nunc gravida.

Matias Lee

Machine Learning
Machine Learning
Matias Lee

PhD in Computer Science from National University of Cordoba

  • ex-Instructor of Machine Learning at National University of Cordoba
  • 10 years experience in data science and machine learning
  • Chief Data Science Officer in multiple companies
  • Publications: https:// scholar.google.com/citations? user=eWYixqsAAAAJ&hl=en
Data Science

Data Science

Accelerate your career in data science by building a strong foundation in cutting-edge technologies in the field, learning software development best practices, and working closely with senior engineers on real world data science projects.

Technology

  • Learn the basis of data manipulation, visualization, and machine learning
  • Learning about feature engineering, mutual information, clustering, principle component analysis, encodings
  • Creating, reading and writing data frames. Indexing, selecting, and assigning. Summary functions and maps. Grouping and sorting
  • Deployment of models, creating APIs to serve predictions
  • Learn about scikit-learn pipelines, numpy, scipy, FastAPI, Docker
  • How to use Pandas for data analysis
  • How to use Tensorflow to solve regression problems
  • End to end design, training, development and deployment of ML models

Project Management & Tools

  • Learn about different software methodologies like scrum, kanban
  • Learn about how to create a roadmap for a product milestone
  • Learn how to use different project management tools like Jira, Slack, Github, Notion etc.
  • Learn how to structure your communication to convey ideas clearly
  • Understand how collaboration happens in a modern distributed software development organization
  • How to use version control effectively and learn methodologies like git-flow
  • Learning test driven development
  • How to setup and use continuous integration for keeping high code quality
  • Best practices in testing across unit tests, performance tests, security tests, end to end tests, behavior driven testing, cross device testing
  • Best practices in deploying code across multiple environments
  • Best practices in setting up alerting and monitoring for production applications
  • Techniques in debugging code in development and production
Data Science

Data Science

Accelerate your career in data science by building a strong foundation in cutting-edge technologies in the field, learning software development best practices, and working closely with senior engineers on real world data science projects.

Technology

  • Learn the basis of data manipulation, visualization, and machine learning
  • Learning about feature engineering, mutual information, clustering, principle component analysis, encodings
  • Creating, reading and writing data frames. Indexing, selecting, and assigning. Summary functions and maps. Grouping and sorting
  • Deployment of models, creating APIs to serve predictions
  • Learn about scikit-learn pipelines, numpy, scipy, FastAPI, Docker
  • How to use Pandas for data analysis
  • How to use Tensorflow to solve regression problems
  • End to end design, training, development and deployment of ML models

Project Management & Tools

  • Learn about different software methodologies like scrum, kanban
  • Learn about how to create a roadmap for a product milestone
  • Learn how to use different project management tools like Jira, Slack, Github, Notion etc.
  • Learn how to structure your communication to convey ideas clearly
  • Understand how collaboration happens in a modern distributed software development organization
  • How to use version control effectively and learn methodologies like git-flow
  • Learning test driven development
  • How to setup and use continuous integration for keeping high code quality
  • Best practices in testing across unit tests, performance tests, security tests, end to end tests, behavior driven testing, cross device testing
  • Best practices in deploying code across multiple environments
  • Best practices in setting up alerting and monitoring for production applications
  • Techniques in debugging code in development and production
Data Science

Data Science

Accelerate your career in data science by building a strong foundation in cutting-edge technologies in the field, learning software development best practices, and working closely with senior engineers on real world data science projects.

Technology

  • Learn the basis of data manipulation, visualization, and machine learning
  • Learning about feature engineering, mutual information, clustering, principle component analysis, encodings
  • Creating, reading and writing data frames. Indexing, selecting, and assigning. Summary functions and maps. Grouping and sorting
  • Deployment of models, creating APIs to serve predictions
  • Learn about scikit-learn pipelines, numpy, scipy, FastAPI, Docker
  • How to use Pandas for data analysis
  • How to use Tensorflow to solve regression problems
  • End to end design, training, development and deployment of ML models

Project Management & Tools

  • Learn about different software methodologies like scrum, kanban
  • Learn about how to create a roadmap for a product milestone
  • Learn how to use different project management tools like Jira, Slack, Github, Notion etc.
  • Learn how to structure your communication to convey ideas clearly
  • Understand how collaboration happens in a modern distributed software development organization
  • How to use version control effectively and learn methodologies like git-flow
  • Learning test driven development
  • How to setup and use continuous integration for keeping high code quality
  • Best practices in testing across unit tests, performance tests, security tests, end to end tests, behavior driven testing, cross device testing
  • Best practices in deploying code across multiple environments
  • Best practices in setting up alerting and monitoring for production applications
  • Techniques in debugging code in development and production
Pytorch
Pandas
Scikit-learn
Pytorch
Pandas
Scikit-learn
Data Science
Data Science
Pytorch
Pandas
Scikit-learn
Tensorflow
MLFlow
Kubernetes
Kubeflow
Sagemaker
Python
SplitIO
Pytorch
Pandas
Scikit-learn
Tensorflow
MLFlow
Kubernetes
Kubeflow
Sagemaker
Python
SplitIO