-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Messages stop being delivered to a consumer [2.10.24] #6290
Comments
There are two issues here: 1. Why consumer do not get the messges? For that we need a bit more info - first of all - are you sure that the consumer in code matches the name with the consumer from info? Maybe it would be useful to have consumer info called from withing the code to be sure. 2. Tight loop
That means that if there are no messages for given consumer, it will close the Instead, you could use However, probably the best way is to use messages() which will handle looping over batches for you in optimized way. |
Thanks, will take a look.
Where is this mentioned and is this the same case when using |
well in |
Ok, I am going to try with |
I tested with batch and I see the same issue. For example, I don't see waiting pulls when that issue happens. This is one of the consumer
|
It's normal on Could you provide |
@Jarema : I am a colleague of @niks3089. I just deployed some code on our end to debug this. Will let you know of the outcome when we observe this again. async fn log_consumer_info(consumer: &Mutex<PullConsumer>, name: &str) {
let mut consumer = consumer.lock().await;
let previous_pending = consumer.cached_info().num_pending;
let info = consumer.info().await;
match info {
Ok(info) => {
// we consider a consumer stuck if it has pending messages for two consecutive periods
let is_stuck = previous_pending > 0 && info.num_pending > 0;
if is_stuck {
statsd_count!("execution.nats.stuck_consumer_detected", 1);
error!("consumer info for {}: {:?}", name, info);
}
}
Err(e) => {
error!("getting consumer info for {} failed with {}", name, e);
}
}
} |
In the natscli from main there are a couple of commands that would be good to try to check the state of your consumers:
|
could you share the stream info as well? |
Observed behavior
I have multiple consumers connecting to multiple leaf nodes across multiple regions and I am having issues with some consumers which don't process the messages even though they're available and messages are available to process.
Basically, the consumer stalls and doesn't make any progress.
The code to fetch the data is like this
So when I log the above code,
messages.next()
doesn't return any messages and it keeps running in busy loop. This is a severe problem. Is it a known issue? Am I missing something? Any help is appreciated. Thanks!Note: I ack the message in handle_project_message and the same code works for other consumers and random consumer gets stuck
Expected behavior
The consumers like other similar consumers connected to the same cluster and node processes the messages and I don't know what triggers this issue
Server and client version
nats-server: 2.10.24
Host environment
No response
Steps to reproduce
No response
The text was updated successfully, but these errors were encountered: