论文包括分布式文件系统,分布式数据库和分布式调度。随着分布式系统的发展,文件系统和数据库的分野越来越模糊,为了便于融合思考,不再分类整理文章。 重工程,轻理论,是国内很多程序员,甚至是一些senior的程序员最终走不下去的原因,也是国内缺乏科学家级别程序员的原因。
Achieving Rapid Response Times in Large Online Services
Building Large-Scale Internet Services
Some Potential Areas for Future Research
Designs, Lessons and Advice from Building Large Distributed Systems
Software Engineering Advice from Building Large-Scale Distributed Systems
The Rise of Cloud Computing Systems
Building Software Systems at Google and Lessons Learned
Bigtable: A Distributed Storage System for Structured Data
The Chubby lock service for loosely-coupled distributed systems
Online, Asynchronous Schema Change in F1
F1: A Distributed SQL Database That Scales
F1 Query: Declarative Querying at Scale
F1 - The Fault-Tolerant Distributed RDBMS Supporting Google's Ad Business
Spanner: Google's Globally-Distributed Database
Spanner: Becoming a SQL System
Spanner, TrueTime and the CAP Theorem
Large-scale Incremental Processing. Using Distributed Transactions and Notifications
Pregel: A System for Large-Scale Graph Processing
Megastore: Providing scalable, highly available storage for interactive services
Dremel: Interactive Analysis of Web-Scale Datasets
Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases
Amazon Aurora: On Avoiding Distributed Consensus for I/Os, Commits, and Membership Changes
Borg, Omega, and Kubernetes
Firmament: Fast, Centralized Cluster Scheduling at Scale
Goods: Organizing Google's Datasets
Mesa: Geo-Replicated, Near Real-Time, Scalable Data. Warehousing
Storage Architecture and Challenges
GFS: Evolution on Fast-forward
The Google File System
Crash Consistency
Leases: An Efficient Fault-Tolerant Mechanism for Distributed File Cache Consistency
Weak Consistency: A Generalized Theory and Optimistic Implementations for Distributed Transactions
Paxos Made Live
Tuning paxos for high-throughput with batching and pipelining
Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore
Logical Physical Clocks and Consistent Snapshots in Globally Distributed Databases
High-Availability at Massive Scale: Building Google’s Data Infrastructure for Ads
Eventually Consistent
A one size fits all database doesn't fit anyone
Calvin: Fast Distributed Transactions for Partitioned Database Systems
Data Consistency Properties and the Trade-offs in Commercial Cloud Storages: the Consumers’ Perspective
From Research to Practice: Experiences Engineering a Production Metadata Database for a Scale Out File System
Consistency Tradeoffs in Modern Distributed Database System Design
PacificA: Replication in Log-Based Distributed Storage Systems
HDFS scalability: the limits to growth
Consistency Rationing in the Cloud:Pay only when it matters
Making Geo-Replicated Systems Fast as Possible,Consistent when Necessary
Monkey: Optimal Navigable Key-Value Store
No compromises: distributed transactions with consistency, availability, and performance
Highly Available Transactions: Virtues and Limitations
High Performance Transactions in Deuteronomy
High-Performance Transaction Processing in SAP HANA
Chain Replication for Supporting High Throughput and Availability
Dynamo: Amazon’s Highly Available Key-value Store
PNUTS: Yahoo!’s Hosted Data Serving Platform
Hekaton: SQL Server’s Memory-Optimized OLTP Engine
Transactional storage for geo-replicated systems
Eventual Consistency Today: Limitations, Extensions, and Beyond(slides)
Eventual Consistency Today: Limitations, Extensions, and Beyond
Rethinking Cost and Performance of Database Systems
High Performance Transactions via Early Write Visibility
In-Memory Performance for Big Data
Incremental Consistency Guarantees for Replicated Objects
Enhanced Paxos Commit for Transactions on DHTs
Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency
一种高效的广域网络锁互斥机制
OctopusFS: A Distributed File System with Tiered Storage Management
OctopusFS in Action: Tiered Storage Management for Data Intensive Computing
The Hadoop Distributed File System
PVFS: A Parallel Virtual File System for Linux Clusters
Data-intensive file systems for Internet services: A rose by any other name ...
To BLOB or Not To BLOB: Large Object Storage in a Database or a Filesystem?
基于天河2高速互连网络实现混合层次文件系统H2FS高速通信
蓝鲸分布式文件系统的分布式分层资源管理模型
大容量、高性能、高扩展能力的蓝鲸分布式文件系统
Ceph: A Scalable, High-Performance Distributed File System
面向大数据分析的分布式文件系统关键技术
数字有机体文件系统的文件写机制
海量小文件存储文件系统研究综述
大数据管理:概念、技术与挑战
文件系统的发展脉络
云计算环境下的分布存储关键技术
Architecture and Dependability of Large-Scale Internet Services
腾讯云大规模任务调度系统的架构蜕变
Designing Cluster Schedulers for Internet-Scale Services
Cluster Scheduling for Data Centers
Learned Index Structures
AresDB: Uber’s GPU-Powered Open Source, Real-time Analytics Engine
The Case for Learned Index Structures
SageDB: A Learned Database System
Serverless Computing: One Step Forward, Two Steps Back
Betrayal, Distrust, and Rationality: Smart Counter-Collusion Contracts for Verifiable Cloud Computing