The cassandra-stress is a tool for benchmarking the ScyllaDB database. It’s very helpful in evaluating any solution and to assess cluster and data model performance. It can also help with sizing and SLA estimations. The lesson goes over a basic example of how to use it.
Cassandra Stress
it’s a tool that came from Cassandra to generate load on your cluster so you can
ingest data so let’s say you want to have a dataset with 300 million records
and you want to see how it behaves while you ingesting the data use cassandra
stress for that we use cassandra stress extensively if you go to the ScyllaDB
website most of our benchmarks use Cassandra
stress some YCSB yeah this is this is what you used to test a
real use case so because you can use you know basic comments like this one it’s
going to write I think it’s ten billion records with consistence level of one
using sixteen connections five columns of 64 bytes repetition factor of
1 so this is just for write and you could use for reads or you can create
your own personalized workload and you can specify your schema so you can say
you know what I have this columns with this data type
the size of this information could be from X to Y you can get complex very
fast but the good thing is that you can simulate exactly what you want for your
application and then you can experiment with that so let’s say you know I’m not
sure about which compaction strategy to use I’m kind of on the fence you know
what run two workloads with different compaction strategies and everything
else is the same you see which one is more advantage
so user-defined mode is great for simulating this you know real workloads
sometimes Cassandra says has some limitations that you cannot get past in
this case you have to create your own code do your own hack but usually it
solves most of the problems then you can test different types of workloads as I
mentioned consistency level replication factor compaction strategies maybe are
trying a new feature from ScyllaDB maybe you want to test workload prioritization
maybe you want to test incremental compaction strategy go for it and again use
it in a controlled environment don’t go don’t there’s no reason
no good reason I can come up for running. Cassandra stress on a production
cluster this is for simulating stuff you don’t want to use that in production so
please documentation right there there’s a gazillion options as usual