Dapper, a Large-Scale Distributed Systems Tracing Infrastructure を読んだ

論文

Dapper, a Large-Scale Distributed Systems Tracing Infrastructureを読んだ時のメモ。どんなもの？分散トレーシングシステム分散システムのtraceをする先行研究とくらべて何がすごい？設計上の目標オーバーヘッドが少ないサービスのパフォーマンス…

2017-07-28

In Search of an Understandable Consensus Algorithm(Extended Version) を読んだ

論文

[In Search of an Understandable Consensus Algorithm(Extended Version)(https://raft.github.io/raft.pdf)を読んだ時のメモ。どんなもの？わかりやすさを重視して開発された合意アルゴリズム Paxosよりもわかりやすいが、Paxosよりも効率的先行研究と…

2017-07-26

Paxos Made Live - An Engineering Perspective を読んだ

論文

Paxos Made Live - An Engineering Perspectiveを読んだ時のメモ。どんなもの？ Paxosを実際のプロダクト(Chubby)で使用するために行った挑戦とその際に選択したアルゴリズムについて Paxosは論文には1ページの擬似コードで説明されているが、実プロダクト…

2017-07-25

Paxos Made Simple 読んだ

論文

Paxos Made Simpleを読んだ時のメモ。どんなもの？ Paxos 分散合意アルゴリズム複数のプロセスが値を提案した時に、どのように1つの値を選ぶかについて技術や手法の肝は？ The Problem safety requirements 提案された値のみが選ばれる 1つの値のみが選ば…

2017-07-13

Kafka: a Distributed Messaging System for Log Processing を読んだ

論文

Kafka: a Distributed Messaging System for Log Processingを読んだ時のメモ。どんなもの？ LinkedInによって開発された分散メッセージングシステム大容量のログを高スループットで配信、低レイテンシで収集することを目的としている先行研究とくらべて…

2017-07-11

CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data を読んだ

論文

CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Dataを読んだ時のメモ。どんなもの？擬似ランダムデータ分散アルゴリズムデータの名前に対して偏りが無いようにデータノードを割り当てる central allocatorがいなくても新しいデー…

2017-07-10

Session Guarantees for Weakly Consistent Replicated Data を読んだ

論文

Session Guarantees for Weakly Consistent Replicated Data を読んだ時のメモ。どんなもの？ weak consistencyのread-any, write-anyの特性を活かしつつも、ある1つのクライアントからは一貫性があるように見えるようにしたものモバイル端末のユーザはrea…

2017-07-05

分散システム原理とパラダイムの同期について読んだ

読書

Time, Clocks, and the Ordering of Events in a Distributed System を読んでいたが、ぼんやりとしか分からなかったので、まず下記の同期の章を読んだ。分散システム―原理とパラダイム作者: アンドリュー・S.タネンバウム,マールテン・ファンスティーン,An…

2017-07-04

Cassandra - A Decentralized Structured Storage System を読んだ

論文

Cassandra - A Decentralized Structured Storage Systemを読んだ時のメモ。どんなもの？分散ストレージシステム大規模データを多数の一般的なサーバに分散させることで可用性を高めて、SPOFを無くすソフトウェア側で可用性とスケーラビリティをコントロ…

2017-07-03

MapReduce: Simplified Data Processing on Large Clusters を読んだ

論文

MapReduce: Simplified Data Processing on Large Clustersを読んだ時のメモ。どんなもの？巨大なデータセットを処理するプログラミングモデル Map key/valueのinputを中間的なkey/valueペアにする Reduce 全ての中間的なvalueを中間的なkeyでまとめる多…

2017-07-01

The Dataflow Model を読んだ

論文

The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processingを読んだどんなもの？ unbounded で順不同なデータを処理する上で、正確性、latency、costをいい感じに…

ikemonn's blog

技術ネタをちょこちょこと

2017-07-01から1ヶ月間の記事一覧