Solved

Numerous Backup Cache Issues

  • 1 November 2023
  • 1 comment
  • 92 views

Userlevel 3

So I’m am basically a day or two away from taking the time and going through and disabling the backup cache across all of my customers, it causes way too many problems. But thought I’d try asking here as a last resort since Veeam’s support seems useless as usual and my R&D forum posts aren’t getting any responses. 

I have had various cache related issues, especially since the ridiculously buggy V12 update, but here this past week it’s been especially bad. I have about 20 computers, that I know of right now, which are failing to upload the backup cache to the VCC server. As such these computers have not been backing up. At first I thought maybe they just needed some time to catch up or something, but it’s been over a week now and they’re not even modifying the files in the VCC repository. (Side note, yes, I did go look at the files, because it’s useful, even though Veeam says there’s no reason I should ever need to look at the files.)

Seems there’s numerous different errors and behaviors across the affected agents, some show status in the VSPC, some don’t, some show data transferred in the VCC console, some don’t, some show files being modified, some don’t (none show an actual increase in file size, so I don’t know what’s changing, but I’m thinking the file contents aren’t actually being modified), some show errors in the VSPC, some don’t.

My latest one I’ve looked more closely at does the following:

Shows the backup cache sync job running in the VCC console, shows the backup job running in the VSPC, updates (continually) the modified time for the relevant VIB file in the repository, the file size has stayed the same for several days, the VBM file modified time is also being updated. The job in the VCC console shows data being transferred. The job ends after a short time. When not running the VSPC shows a “cache is full” error. Which I’m assuming is why new backups are not going to the cache, and has nothing to do with the cache getting uploaded to the server.

I’ve also had other issues on other agents here at the same time, one which showed no activity in VSPC and no data transferred in the VCC console, though it did show a job running for seconds at a time, I wouldn’t have noticed it except for the bug in the console that causes completed jobs to not get removed from the “Running” view. This job said there was a corrupt file somewhere, in the logs, not in an error message actually reported to anything. Apparently Veeam can’t tell from the error message or the log files where the corrupt file is.

That scenario doesn’t make sense to me though, if the corrupt files in the repository, why did manually disabling the cache (to empty it) fix the problem? If the issue was a corrupt file in the cache, how did a VCC issue cause that? 

I am convinced there was some sort of VCC issue that caused all this, because there’s too may different customers affected at the same time for it to be anything else that I can think of. But no idea what sort of issue could be the problem, other customers are working just fine, it’s just the cache syncing jobs that are broken. 

Most of the issues have been fixed by simply disabling the cache, restarting the agent, and running the job. But I can’t come up with anything that could be caused by the VCC infrastructure having a problem, but could be fixed by the cache being cleared on the agent side.

So, some questions I’d like answered if anyone knows:

Does the backup cache syncing process literally just copy the files from the cache to the repository? Or does it do something funny and unnecessarily complicated like when you do a backup copy job? 

Anyone else seen similar issues with cache syncing that were resolved by disabling the cache?

Anyone have any general ideas? I’m still working with support, but thus far no one’s found what the problem is.

icon

Best answer by BackupBytesTim 8 November 2023, 14:48

View original

1 comment

Userlevel 3

Update for anyone who read this, we have decided the backup cache feature is far too buggy and unreliable and have decided that it is one more aspect of Veeam we shall not be utilizing. 

The issue was, based on my own and Veeam support’s findings, a corrupted file located on the computer being backed up. It appears the computer lost the connection to the VCC server in the middle of a cache sync and this somehow resulted in a corrupted file on the computer which then prevented the cache from syncing, which in turn prevented the backup job from running at all. Unfortunately Veeam was unable to determine what exactly happened. So the solution we were left with was just disable the cache in order to delete the cache contents. It could then be re-enabled and would work (I can corroborate that it was working after that) however we can’t go on using a feature that could break so catastrophically from something as simple and common as a lost connection. So we’ve disabled the backup cache on a large number of computers and will not enable it in the future, we’ll likely be disabling it on the rest of our customers as they have issues arise, which from my experience I can say will happen sooner or later.

I’ll admit that I’ve never considered a backup to the cache as a true “backup” due to relying on the internal disk, however I must say that I am once again disappointed in Veeam for being as unreliable as it is. 

Comment