Measuring AI search is notoriously tough because answers are non-deterministic...
https://fun-wiki.win/index.php/Building_Stability_in_the_Age_of_Entropy:_Solving_Format_Drift_in_LLM_Pipelines
Measuring AI search is notoriously tough because answers are non-deterministic and shift with heavy geo variability. Standard trackers fail