The auditory output channel is rather under-utilized in smart object to human communication. One reason is that in a smart environment, multiple overlapping audio sources can be disturbing to people. We propose a wearable audio augmentation system which allows people to effortlessly select and switch between sound sources given their interest. Our system leverages visual contact via the head pose as a measure of interest towards a smart object. We demonstrate a prototype implementation in three application scenarios and a preliminary user evaluation.