Dataflow: ratio requestor/partitions
Hi everyone,
I am running a dataflow sourced by a report definition, with a defined partition key.
Let's assume the partition key can have x possible distinct values, and the dataflow will run on y nodes.
My questions are the following:
1) in the options configuration panel, before launching the dataflow execution, which is the most proper number of requestor I should set to maximise the throughput?
2) is it correct to assume that the number of requestor * number of nodes must be equal or slightly grater than the possible distinct values of the partition key?
Thanks to anyone who can help me with this doubt.