Comments (3)
对,我这里可能没有说清楚,其实我的意思是:如果很难或者不能在上某个类型上定义正确的partition(),那么这个类型不能当作Key使用。Hadoop里面也是一样,一些自定的class可以当作key使用,但如果无法在上面定义partition(),那么就不能当作key。
from sparkinternals.
那段注释是java数组的hashcode是基于数组的标示符而不是基于数组内容(元素),使用array作为分区键会产生不可预料不正确的结果。其实不光是数组,map作为分区键也不合适。这个东西可以引申一下,如果作为分区键的class很难产生value一致排序的分区,则不适合做分区键
from sparkinternals.
@lqian 谢谢建议,下次修改时加上
from sparkinternals.
Related Issues (20)
- Shuffle details的一点建议 HOT 2
- 一个问题,就是Spark是不是能把所有传入作为参数的函数都分布式进行计算?对吗? HOT 1
- Narrow dependencies-第二章第二节图FullDependency: N : N HOT 5
- readme.md的content目录,英文版的link不统一 HOT 2
- 关于CogroupRDD的一点疑问以及依赖的一点问题 HOT 2
- 第一章配置多个 CoarseGrainedExecutorBackend 进程 HOT 2
- 关于cache()的疑问 HOT 1
- 关于partitioner的疑问
- 关于第三章第二幅图的理解 HOT 4
- 请教个关于spark具体应用设计问题 HOT 3
- Spark 2.0 Content HOT 2
- java.lang.UnsupportedOperationException: Cannot evaluate expression: PythonUDF#Grappra(input[410, StringType])
- reduceByKey 函数 map 端 combine 的实现变化
- 第二章 JobLogicaPlan.pdf 存在笔误
- 请问文章中的图是用什么软件画的?
- 还会继续更新吗?最后的两个章节? HOT 1
- 关于第二章JobLogicalPlan的一点小意见 HOT 1
- Please add README.md to markdown/thai HOT 1
- Why the definition of dependencies is different from RDD paper?
- gitbook无法下载
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sparkinternals.