
大数据
文章平均质量分 80
kiwi小白
这个作者很懒,什么都没留下…
展开
-
<<Big Data: Principles and Best Practices of Scalable Realtime Data Systems>>读书笔记
Chapter 1 A New Paradigm for Big Data 1.1 How this Book is structured focus on principles of big data problem => theory / illustration 1.2 Scaling with a traditional database original proble原创 2015-10-22 20:23:47 · 3556 阅读 · 2 评论 -
ZooKeeper相关资料
ZooKeeper: enables coordination for distributed system Similar to multithread programming, but shared nothing. Easier with a component provide share store, like ZooKeeper. ZooKeeper manage data原创 2015-10-14 23:56:33 · 2195 阅读 · 0 评论 -
《Hadoop: The Definitive Guide》读书笔记 -- Chapter 3 The Hadoop distributed Filesystem
Chapter 3 The Hadoop distributed Filesystem Design: 1. Very Large Files 2. Streaming Data Access (Write once, read-many-times) 3. Commodity Hardware NOT GOOD FIT 1. Low-latency data acce原创 2015-10-08 19:25:40 · 2047 阅读 · 0 评论 -
《Hadoop: The Definitive Guide》读书笔记 -- Chapter 2 MapReduce
Chapter 2 MapReduce Analyzing the Data with Unix Tools Problems 1. dividing work into equal-size isn't always easy or obvious. 2. Combining the results from independent processes may require f原创 2015-10-08 19:02:59 · 1871 阅读 · 0 评论 -
《Hadoop: The Definitive Guide》读书笔记 -- Chapter 1 Meet Hadoop
Preface Stripped to its core, the tools that Hadoop provides for working with big data are simple. If there is a common theme, it is about raising the level of abstraction -- to create building bloc原创 2015-10-01 16:56:48 · 2027 阅读 · 0 评论