Join us at ScyllaDB University Live, instructor-led, online, hands-on, training sessions | September 24
Register now

Filtering and Denormalizing Data

7 min to complete

This lesson talks about Filtering and Denormalization of data.
In this section we’re going to introduce a data modeling problem, present different solutions to it, and comment on each, showing pros and cons.
Let’s rewind about our last session: Previously we did a simple modeling of a tweets and timeline query, and, at this time, we’ll focus only on the timeline feature.
Our data modeling strategy is to partition the timeline by username, and cluster it by created_at in descending order. This approach ensures that whenever we query the timeline table, we’ll get the most recent tweets added to that particular timeline.
But now, we’d like to enhance our application by adding a timeline feature that filters by liked tweets from a user and retrieves the fifty-first items.
However, we’re facing a limitation due to our current data modeling approach.
The lack of a liked column inside the partition or clustering keys prevents us from executing this query. Attempting to do so will result in an error.
What are the different options for solving this?

 

Transcript
fa-angle-up