An operating system project of mine (that importantly relies on another coexisting operating system to run) sometimes can lose access to logging when critical system errors occur. An example situation might be that the coexisting operating system crashes while interacting with mine, and in those cases I can't get logs out before the processor resets.
In most cases, I can consume logs from the machine using serial ports or a circular buffer (accessed via an external DMA device), but I've encountered a unique situation where the coexisting operating system now commandeers the serial port and runs a hypervisor which prevents me from getting either serial port logs or DMA access before the device resets.
Would anybody have any suggestions as for how I could store the diagnostic data I collect about crashes at a location that would survive reboots incurred from something like a triple fault?
I would also like for this data, ideally, to persist between both warm and cold resets, wherein the machine may lose power briefly before coming back online.
So far I've explored:
- the idea of sticking these diagnostic structures at high points in memory (our example system has 32GB of RAM, after inspecting the memory map provided to the OS from UEFI I put the structure in available system memory at around the 31GB mark), but this memory was cleared after rebooting into a UEFI shell (no memory test was performed per the boot settings)
- saving the UEFI runtime functions for Get/SetVariable, but each of our diagnostic structures are ~2KB in length and we have one for each of the 32 processors on test machine; which is too much memory to store in those NVRAM variables + we don't have those regions mapped into our OS paging tables anyway
- using the disk, which I believe could become complicated depending upon which OS coexists with mine, along with what hardware is used
- using the TPM, but it doesn't seem to have enough space available for use, and may not be present on most hardware
- using SPI flash memory
- using ACPI NVS memory
- using on-board controller memory, which I heard was possible but seems like a bad idea
If anybody has ideas I would love to workshop some of them.
Thank you all for your time
