Spatial perception, in which objects’ motion and positional relationship are recognized, is necessary for applications such as a walking robot and an autonomous car. One of the demanding features of spatial perception in real world applications is robustness. Neural network-based approaches, in which perception results are obtained by voting among a large number of neuronal activities, seem to be promising. We focused on a neural network model for motion stereo vision proposed by Kawakami et al. In this model, local motion in each small region of the visual field, which comprises optical flow, is detected by hierarchical neural network. Implementation of this model into a VLSI is required for real-time operation with low power consumption. In this study, we reduced the computational complexity of this model and showed cell responses of the reduced model by numerical simulation.