According to the received view in the philosophical literature on pictorial perception, when perceiving an object in a picture, we perceive both the picture’s surface and the depicted object, but the surface is only unconsciously represented. Furthermore, it is suggested, such unconscious representation does not need attention. This poses a crucial problem, as empirical research on visual attention shows that there can hardly be any visual representation, conscious or unconscious, without attention. Secondly, according to such a received view, when looking at a picture aesthetically, one both consciously represents and visually attends to both the depicted object and the picture’s surface simultaneously. Thus, contra the empirical research on attention, only conscious visual representations are coupled, by such current view, with attention. And this clearly poses a second problem, as this philosophical account is not in tune with what vision science tells us about the functioning of our visual system. Furthermore, this raises another crucial problem, namely, that of explaining why aesthetic experience of pictures does not feel odd or conflicting, since, as previously noted in the philosophical literature, and contra the received view, if we are simultaneously consciously perceiving both the picture’s surface and the depicted object, there seems to be two things, at the same time, in the foreground of one’s visual consciousness. But, if so, as suggested, this would lead to a conflicting spatial visual experience. This paper offers a new description of the role of visual attention in picture perception, which explains the difference between the usual and the aesthetic way of perceiving a depicted object, without facing the problems reported above. A crucial role in our new account is played by the notion of unconscious attention, the distinction between focal and distributed, as well as top-down and bottom-up visual attention and the relationship between visual attention and visual consciousness. The paper, thus, offers the first theory concerning the exercise of visual attention in pictorial perception that is both philosophically rigorous and empirically reliable.