pyFTSIncrementalIndexer not running or turning off automatically

Support Doc

MaryCarbonara

Member since 2010

216 posts

Posted: Sep 27, 2022

Last activity: Sep 23, 2024

Posted: 27 Sep 2022 10:50 EDT
Last activity: 23 Sep 2024 12:33 EDT

pyFTSIncrementalIndexer not running or turning off automatically

Applies to Pega Platform™ versions 8.4.6, 8.5.6, 8.6.3, 8.6.4, 8.6.5, 8.7.x, 8.8.x, and 23.x

Symptoms
Explanation
Solution
Planned enhancement
Related content

Symptoms

You are using embedded search in your environment which is now deprecated in the latest releases, where Pega nodes are hosting Elasticsearch indexes and managing them and experience the following:

Searches for newly created work fail.
pyFTSIncrementalIndexer is not running or is turning off automatically.

Because these symptoms are caused by other conditions in your environment, you need to analyze the specifics of your environment to determine precisely what led to the reported behavior.

Here are some example scenarios.

Scenario 1: Index Status is RED or YELLOW

If the status of your indexes is RED, or in some cases YELLOW, the pyFTSIncrementalIndexer will automatically turn off.

How to gauge the health of Elasticsearch clusters and shards describes how to determine if this is the case.

Scenario 2: Stream Issue

If the Stream Service is not running in your environment or is having issues, this can cause the pyFTSIncrementalIndexer to not run. Navigate to the Queue Processor landing page in Admin Studio to check if your other Queue Processors are running and verify if there is a banner at the top of that page stating that the Stream Service is unavailable.

For more information on how to troubleshoot Stream service issues, refer to: Troubleshooting the Stream service

https://support.pega.com/question/error-stream-service-not-running-clus…

Scenario 3: Elasticsearch Failed to Initialize

With embedded search, all nodes must initialize Elasticsearch as part of startup. If Elasticsearch fails to initialize on an index host node, this can cause the pyFTSIncrementalIndexer to either stop on its own or in some rare cases cause it to report processing hundreds of thousands of records, but not drop the Ready to Process count over time.

You can identify if this has occurred by one of the following ways:

Checking PDC to see if an OPS0010 has been reported, which will include the node it occurred on.
Checking the pyIndexerState column in the data_schema.pr_sys_statusnodes table for a node reporting FAILED.
Generating a current system state for your cluster and reviewing the SearchState.json file. This includes a section about the index host nodes and will also state whether the pyIndexerState has FAILED or not.

In these cases, we have typically relied on a restart of the affected index host node to recover, however we have created a package for clients still using embedded search on Pega versions 8.5.6+ that should allow your Pega application to self heal if this is encountered by restarting Elasticsearch on nodes where it failed to initialize as expected.

Solution:

The vast majority of issues we see with this can be resolved by downloading the ReinitFailedNodes.zip RAP file attached to this article and importing the file from Dev Studio using the default import options. The RAP file has both schema changes and rules that will need to be imported and applied.

This will add the Job Scheduler ReinitFailedNodes to your environment which will start running every hour on Search and BackgroundProcessing nodes and attempt to reinitialize Elasticsearch on those nodes if it has failed.

Given that Elasticsearch has already failed on a node where this will execute, this should have no negative impact to your environment. If this does not resolve the problem, look for errors during the last startup related to search not initializing as there may be something else going on that requires further investigation.

Using full-text search based in Elasticsearch

Checking search index status

Manage batch indexing easily (Pega Platform 8.3)

Rebuilding search indexes

Determining which job schedulers are running

Managing queue processors

Tracing a queue processor

Log levels for log categories

This Support Document was prompted by the INC Collection and related client case work cited in Related Support Case Numbers. It is tracked by US-492244.

To see attachments, please log in.

Pega Platform 8.7

Pega Platform

System Administration

Reporting

Troubleshooting

Did you find this content helpful?

Yes

Want to help us improve this content?
Send Feedback

Reply
Likes (1)

Pooja Gadige
Share this page Facebook Twitter LinkedIn Email Copying... Copied!

Posted: 2 years ago

Posted: 14 Nov 2022 9:57 EST

ANKITMITRA

Cognizant Technology Solution

replied to MaryCarbonara

Report

@MaryCarbonara Hello - We are upgrading our application from 8.6.2 to 8.6.5 and face the same issue. We have 6 Nodes in cluster.

3 Nodes - Cutom1 node type

3 Nodes - Search,Backgroundprocessing,Stream node type

We have pr_index set up on all 6 nodes.
So what we have done we have set the DSS (indexing/distributed/expected_search_node_count) as 6. Did a rolling restart.
But still facing the issue.

Any Suggestion on this please.

To see attachments, please log in.

Posted: 2 years ago

Posted: 14 Nov 2022 11:41 EST

MaryCarbonara replied to ANKITMITRA

Report

@ANKITMITRA Thanks for describing your problem scenario, updating from Pega Platform version 8.6.2 to 8.6.5.

@NickLoving_GCS and @szadp Can you help? If a significant update to this Support Document is needed, please submit a FDBK item to TSO KM BL-10060. Thank you!

To see attachments, please log in.

Reply
Likes (1)

Pooja Gadige

Posted: 1 year ago

Posted: 31 Jan 2024 14:29 EST

NickLoving_GCS

PEGA

replied to ANKITMITRA

Report

@ANKITMITRA There are multiple scenarios described here, which does your situation fall into?

To be clear, the expected_search_node_count DSS (information related to this now moved to How to gauge the health of Elasticsearch clusters and shards), by itself is not a resolution to a single a problem. It should help prevent a specific issue from occurring after you're already in a healthy state. I'm not sure what you mean by having pr_index set on 6 nodes, this sounds unrelated to index host nodes. If that you do mean 6 index host nodes, it's worth noting that 6 is likely more than you need. Our recommendation is typically 3.

To see attachments, please log in.

Posted: 10 months ago

Posted: 23 Sep 2024 11:58 EDT

AkshithReddyT

Evoke Technologies

replied to MaryCarbonara

Report

@MaryCarbonara Hello, We are on 8.6.2 and Deployed our Pega Instance into Red Hat Open Shift and Externalized SRS (Search Reporting Services) as part of Containerization. But right now, I can see the Queue processor: pyFTSIncrementalIndexer is not running and the associated Data Flow is stopped. Can see an error for QP: couldn't move to RUNNING Current status: NOT_RUNNING.

Right now we have 2 Web, 2 Batch and 1 Stream Node, and Stream node is up and running.

Didn't found any OPS0010 in PDC
pyIndexerState is null in data_schema.pr_sys_statusnodes for all of our 5 nodes.

Any thoughts or inputs?

To see attachments, please log in.

Posted: 10 months ago

Posted: 23 Sep 2024 12:13 EDT

NickLoving_GCS

PEGA

replied to AkshithReddyT

Report

@AkshithReddyT

SRS does not use the pyFTSIncrementalIndexer, the pySASIncrementalIndexer will be used for SRS instead. The pyFTSIncrementalIndexer can be left off in this case.

To see attachments, please log in.

Posted: 10 months ago

Posted: 23 Sep 2024 12:26 EDT

AkshithReddyT

Evoke Technologies

replied to NickLoving_GCS

Report

@NickLoving_GCS Ok, but Full text search is not working fine, I mean not able to search for some of the rules.

To see attachments, please log in.

Posted: 10 months ago

Posted: 23 Sep 2024 12:33 EDT

NickLoving_GCS

PEGA

replied to AkshithReddyT

Report

@AkshithReddyT

Could be a lot of things causing issues. Are the SRS pods and Elasticsearch server actually started and healthy? Are the SRS pods successfully communicating with the Elasticsearch server? Is Pega successfully communicating with SRS? Did you synchronize and re-index from the search landing page? Do the pySAS* QPs have broken queue items? What does the broken XML say? I would suggest opening an INC with the answers to those questions.

To see attachments, please log in.

Support Doc

pyFTSIncrementalIndexer not running or turning off automatically

Symptoms

Scenario 1: Index Status is RED or YELLOW

Scenario 2: Stream Issue

Scenario 3: Elasticsearch Failed to Initialize

Solution:

Related content

Need help or want to help others?

Experience the benefits of Support Center when you log in.

Support Doc

pyFTSIncrementalIndexer not running or turning off automatically

Symptoms

Scenario 1: Index Status is RED or YELLOW

Scenario 2: Stream Issue

Scenario 3: Elasticsearch Failed to Initialize

Solution:

Related content

Related content:

Need help or want to help others?

Experience the benefits of Support Center when you log in.

We'd prefer it if you saw us at our best.