Mobile music listening can be traced back to the introduction of Sony Walkman that upgraded music players both with privacy and portability. With the mobile listening media, our daily journeys in public environment have become more privatized, aestheticized and contented. It is a challenge to perform such a private activity in public environment with many people and audio-visual stimuli around. The journeys with music become additionally challenging with the music players' interfaces confined into tiny buttons and screens that heavily rely on visual modality. This paper discusses if the aesthetic experience of mobile listening can be enhanced through the way users control mobile listening media. It presents the analysis of a tri-partite fieldwork to demonstrate how the use of physicality in interaction can contribute to the beauty of our interactions with mobile listening media while dealing with the challenges in the (use) context. This study points out that context comprises inspiration as well as challenge. The results indicate that controlling mobile listening media becomes sensorially enriched and meaningful when physical context is interpreted as a means of interaction.