Search
Part 5: Deep Dive - Spark Window Functions
This is the final part of the deep dive series on understanding internals of Apache Spark Window Functions. In earlier sections, we...
Dec 2, 20246 min read
84 views
0 comments
Part 4: Deep Dive - Spark Window Functions
In part 3 , we explored the internal workings of AggregateProcessor and how it evaluates expressions using our rank() window function...
Nov 29, 20246 min read
87 views
0 comments
Part 3: Deep Dive - Spark Window Functions
In the previous post, we briefly alluded about AggregateProcessor without going into the depth. In this part, we will focus on its...
Nov 22, 20247 min read
77 views
0 comments
Part 2: Deep Dive - Spark Window Functions
This is part 2 of deep dive series on understanding internals of Apache Spark Window Functions. Following our discussion of Spark's rank()...
Nov 19, 20245 min read
146 views
0 comments
Part 1: Deep Dive - Spark Window Functions
This is part 1 of deep dive series on understanding internals of Apache Spark Window Functions. Introduction A window function in query...
Nov 16, 20243 min read
214 views
0 comments
The Design of Causal Consistent Databases
Today's distributed systems are undoubtedly complex and varied, requiring different data consistency guarantees. Amongst all,...
Jul 18, 20239 min read
532 views
0 comments
Erasure codes for Distributed Storage
Data is exploding exponentially and so is the requirement for massively scalable distributed storage systems. A distributed storage...
Nov 10, 20228 min read
292 views
0 comments
BSON - Diving Deep
Background BSON stands for Binary JSON which is a serialization format for binary-encoding JSON-like documents. BSON was developed at...
Jul 6, 20213 min read
596 views
1 comment
MongoDB ChangeStreams - Full Document Lookup
The change streams by default emits only the delta fields for any update operation, ie. the event with operationType update does not...
Jun 13, 20212 min read
464 views
0 comments
MongoDB ChangeStreams - Resuming
Today we are going to explore a very powerful tool in the change streams called the resume token. For motivation purposes, consider, your...
Jun 10, 20214 min read
1,315 views
0 comments
MongoDB ChangeStreams - Filtering and Reshaping
In Part 1 of the series, we saw how to keep a watch on a collection and receive events. This means, that the subscriber receives all the...
Jun 7, 20212 min read
171 views
0 comments
MongoDB ChangeStreams - Introduction
MongoDB 3.6 introduced a new feature called change streams. The change streams is a very powerful feature which allows applications to...
Jun 5, 20212 min read
186 views
0 comments