Giter Site home page Giter Site logo

안녕하세요! 기본기가 탄탄한 데이터 엔지니어 유정수입니다. 😊

💗관심분야

  • Hadoop, Spark, Kafka, Docker ...
  • 대용량 데이터 처리, 분산 시스템, 데이터 분석

🌺블로그 데이터 엔지니어가 되기위해 공부하고 있는 모든 것!!

🎀프로젝트 웹 개발/데이터 분석/추천시스템/딥러닝 등

🌸CS 자료구조/알고리즘/컴퓨터구조/운영체제/네트워크/데이터베이스 등

🧁논문리뷰
  • Piranha : Optimizing Short Jobs in Hadoop, Elmeleegy K
  • Robert H Bonczek, Clyde W Holsapple, and Andrew B Whinston. Foundations of decision support systems. Academic Press, 2014.
  • Yingyi Bu, Bill Howe, Magdalena Balazinska, and Michael D Ernst. Haloop: efficient iterative data processing on large clusters. Proceedings of the VLDB Endowment,
  • An Experimental Comparison of Pregel-like, Systems G Han M Daudjee K Ammar KOzsu M Wang X Jin T
  • Twister : A Runtime for Iterative MapReduce, Ekanayake J Li H Zhang B Gunarathne TBae S Qiu J Fox G
  • The Hadoop Distributed File System, Shvachko K Kuang H Radia S Chansler
  • MapReduce : Simplified Data Processing on Large Clusters, Dean J Ghemawat S
  • Jeffrey Dean and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the aCM, 51(1):107–113, 2008.
  • Hive: a warehousing solution over a map-reduce framework. Proceedings of the VLDB
  • MapReduce Online, Condie T Conway N Alvaro P Hellerstein JElmeleegy K Sears R
  • PACMan: Coordinated memory caching for parallel jobs, Ananthanarayanan G Ghodsi A Wang A
  • Hive: a warehousing solution over a map-reduce framework
  • Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing, Zaharia M Chowdhury M Das T Dave A Ma JMccauley M
  • Flink Forward conference in Berlin. Flink vs spark slideshare. http://www.slideshare.net/sbaltagi/flink-vs-spark? related=2.
  • Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing, Zaharia M Chowdhury M Das T Dave A Ma JMccauley M Franklin M
  • Streaming Data Analysis using Apache Cassandra and Zeppelin
  • Analysis of Hadoop performance and unstructured data using Zeppelin
  • Haloop efficient iterative data processing on large clusters
  • iMapReduce: A Distributed Computing Framework for Iterative Computation
  • Improving MapReduce Performance in Heterogeneous Environments

youjeongsue's Projects

blog.io icon blog.io

:fire:목표는 기본이 탄탄한 데이터 엔지니어가 되는 것.

django-blog icon django-blog

클라우드 컴퓨팅 수업시간에 진행한 django blog입니다

elastic icon elastic

Elastic Stack (6.2.4) 을 활용한 Dashboard 만들기 Project

interview_question_for_beginner icon interview_question_for_beginner

:boy: :girl: Technical-Interview guidelines written for those who started studying programming. I wish you all the best. :space_invader:

khuloud icon khuloud

클라우드 컴퓨팅 수업시간에 진행한 AWS 저장소 서비스 프로젝트 입니다.

kube-prometheus icon kube-prometheus

Use Prometheus to monitor Kubernetes and applications running on Kubernetes

kubeapps icon kubeapps

A web-based UI for deploying and managing applications in Kubernetes clusters

piranha icon piranha

Piranha is a peak-caller for CLIP- and RIP-seq data

sjc-react icon sjc-react

[솔직챌린지]포즈인식 기반 비대면 실기교육 플랫폼, 최종시민투표 참여

smartfarm-dashboard icon smartfarm-dashboard

토마토 스마트팜 실시간 대시보드 - 소프트웨어융합 캡스톤디자인

website icon website

Kubernetes website and documentation repo:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.