March 24, 2026 E. Vasilyeva, A. Leonidov, A. Titov Dynamics of TD(1) reinforcement learning methods in light of frequency factor of choice actions