We have a requirement to mask pega blob data for preparation of non-prod environments. We have tried the approach of masking within pega as described, where you end up looping through the blob object structure, however this is incredibly inefficient and takes many days (for a 400gb work table, or maybe <100gb for an active request partition). Is there a more efficient way of achieving this? Blob encryption is not an option because data masking needs to prevent user from seeing content as well as prevent unmasked data from flowing through BIX.
Are there any scripting utilities which can do that (maybe via prpcUtils)?
We did a build a PoC tool which can do data masking for identified properties, like PII related. The primary intent behind the PoC tool was to mask the data when the data is being replicated from higher environments to lower environments.
In your scenario, I read that you are looking for encryption in the same environment, and it is all the data. Have you considered the DB level encryption options ?
Posted: 4 years ago
Posted: 8 Mar 2019 10:08 EST
Eugene Roytfeld (EugeneR7)
Could you provide more information on this PoC tool? It seems like something that covers what I'm looking for.
In my scenario, i'm specifically not looking for encryption, as we need to data on an (any) environment masked. We do full schema backup and restore, so we don't necessarily care whether we mask data on same environment, or replicate with masking to a different environment, as long as the blob content (selective set of properties deemed relevant for masking) is masked.
We have the similar requirement from Mercedes-Benz Financial China. Would like seek for an efficient approach to mask the customer related data columns from database level including blob columns in lower environment. Is there any feasible solution or Poc?