
Figure: Overview of our framework, consisting of 4 key components: (1) Trajectory Collection, gathering a small set of human trajectories by recording user actions and state observations at each step; (2) Thought Completion, reconstructing the implicit thought process missing in raw human trajectories; and (3) Trajectory Boost, diversifying action decisions to further enhance trajectory quality; (4) Agent Training, developing a strong computer use agent with remarkable data efficiency.