The packaging of meaning in verbs varies widely across languages since verbs are free to encode different aspects of an event. At the same time, languages tend to display recurrent preferences in lexicalization, e.g. verb-framing vs. satellite-framing in motion. It has been noted, however, that the lexicalization patterns in motion are not carried over to the domain of vision, since gaze trajectory (‘visual path’) is coded outside the main verb even in verb-framed languages. This ‘typological split’ (Matsumoto 2001), however, is not universal. This article contains the first extensive report of verb-framing in the domain of vision based on data from Maniq (Austroasiatic, Thailand). The verbs are investigated using a translation questionnaire and a picture-naming task, which tap into subtle semantic detail. Results suggest the meanings of the verbs are shaped by universal constraints linked to earth-based verticality and bodily mechanics, as well as local factors such as the environment and the cultural scenarios of which looking is a salient part. A broader look across the whole Maniq verb lexicon reveals further cases of verbally encoded spatial notions and demonstrates a pervasive cross-domain systematicity, pointing to the language system itself as an important shaping force in lexicalization.