Giter Site home page Giter Site logo

18520339 / ml-distributed-training Goto Github PK

View Code? Open in Web Editor NEW
10.0 1.0 2.0 8.24 MB

Distributed training with Multi-worker & Parameter Server in TensorFlow 2

Home Page: https://youtu.be/OOPVA-eqBTY

Jupyter Notebook 100.00%
distributed-tensorflow multi-workers parameter-server distributed distributed-training tensorflow multi-gpu

ml-distributed-training's Introduction

Hi, I'm Quan

Gmail Youtube Linkedin GoogleScholar

  • 👨‍🎓 Master student at UTS.
  • 📚 I'm pursuing AI with the desire to develop most advanced solutions to solve real-life issues, make people's lives convenient & greatly contribute to my national language.
  • 😭 I'm looking for help with Statistics.
  • ⚡ Fun fact: I love Astronomy 🛰️.
  • 🔔 Subscribe me on Youtube.

⚡ Technologies and Skills:


  • Web Scraping (Python, JS).
  • Web Development (MERN Stack).
  • Machine Learning (TensorFlow).
  • Network Engineering (CCNA).
  • Microsoft Office (Excel, VBA).

💻 Notable projects:

🎬 Popular YouTube Videos (Click Here)
Facebook crawling with Python
Jul 7, 2020
Phần mềm Quản lý học sinh THPT
Feb 4, 2021
VLSP 2018 - Aspect-based Sentiment Analysis for Vietnamese reviews
Apr 15, 2020
Chat Application with Pusher Chatkit
Apr 7, 2020
Distributed Database - ArangoDB Cluster Setup
Jun 22, 2021
Excel VBA - Phần mềm quản lý kho (Warehouse Management)
Apr 8, 2020
C# Winform - Tic tac toe (Game Caro)
Apr 15, 2020

Visitor's count 👀

ml-distributed-training's People

Contributors

18520339 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

ml-distributed-training's Issues

Role of ps and worker

Hey there!
Just wanted to clarify something regarding the code for ps and worker. I've recently started working with such distributed training, so pardon my silly queries.

As much as I've come to know, ps serve parameters to the workers while the later one fetches them. Aside from the difference in the tf_config, I've noticed no code for fetching/serving of parameters particularly dedicated to only ps or only workers. Both share the same code.

I wanted to know how are they coordinating with one another?

A question about parameter server training

Hi, your code really helps! I have one question:
In the coordinator(train_dataset_fn), you use shard to split data to each worker, but the input param(input_context.input_pipeline_id) indicates which worker index is, so I think every worker should call the function(train_dataset_fn) to get his part of data. But your code show that only the coordinator use the train_dataset_fn function.
Can you explain to me how this param(input_context.input_pipeline_id) works
thx!!!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.