πκ΄μ¬λΆμΌ
- Hadoop, Spark, Kafka, Docker ...
- λμ©λ λ°μ΄ν° μ²λ¦¬, λΆμ° μμ€ν , λ°μ΄ν° λΆμ
πΊλΈλ‘κ·Έ λ°μ΄ν° μμ§λμ΄κ° λκΈ°μν΄ κ³΅λΆνκ³ μλ λͺ¨λ κ²!!
πνλ‘μ νΈ μΉ κ°λ°/λ°μ΄ν° λΆμ/μΆμ²μμ€ν /λ₯λ¬λ λ±
πΈCS μλ£κ΅¬μ‘°/μκ³ λ¦¬μ¦/μ»΄ν¨ν°κ΅¬μ‘°/μ΄μ체μ /λ€νΈμν¬/λ°μ΄ν°λ² μ΄μ€ λ±
π§λ Όλ¬Έλ¦¬λ·°
- Piranha : Optimizing Short Jobs in Hadoop, Elmeleegy K
- Robert H Bonczek, Clyde W Holsapple, and Andrew B Whinston. Foundations of decision support systems. Academic Press, 2014.
- Yingyi Bu, Bill Howe, Magdalena Balazinska, and Michael D Ernst. Haloop: efficient iterative data processing on large clusters. Proceedings of the VLDB Endowment,
- An Experimental Comparison of Pregel-like, Systems G Han M Daudjee K Ammar KOzsu M Wang X Jin T
- Twister : A Runtime for Iterative MapReduce, Ekanayake J Li H Zhang B Gunarathne TBae S Qiu J Fox G
- The Hadoop Distributed File System, Shvachko K Kuang H Radia S Chansler
- MapReduce : Simplified Data Processing on Large Clusters, Dean J Ghemawat S
- Jeffrey Dean and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the aCM, 51(1):107β113, 2008.
- Hive: a warehousing solution over a map-reduce framework. Proceedings of the VLDB
- MapReduce Online, Condie T Conway N Alvaro P Hellerstein JElmeleegy K Sears R
- PACMan: Coordinated memory caching for parallel jobs, Ananthanarayanan G Ghodsi A Wang A
- Hive: a warehousing solution over a map-reduce framework
- Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing, Zaharia M Chowdhury M Das T Dave A Ma JMccauley M
- Flink Forward conference in Berlin. Flink vs spark slideshare. http://www.slideshare.net/sbaltagi/flink-vs-spark? related=2.
- Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing, Zaharia M Chowdhury M Das T Dave A Ma JMccauley M Franklin M
- Streaming Data Analysis using Apache Cassandra and Zeppelin
- Analysis of Hadoop performance and unstructured data using Zeppelin
- Haloop efficient iterative data processing on large clusters
- iMapReduce: A Distributed Computing Framework for Iterative Computation
- Improving MapReduce Performance in Heterogeneous Environments