Systems, methods and computer-readable media are provided for capturing audio that is memorable. Buffered audio is stored in a first device over a recent interval such as 5 minutes. In the absence of an indication a circular buffer is used that continually replaces old audio. When a user hears something that he wants to capture, he manipulates a control on the first device that results in buffered audio being transferred to another storage area. An audio file is automatically created. A second device can receive the created audio file, and it is capable of annotating the audio with a more descriptive title, an appropriate photo, etc. The second device is also able to provide a control interface for the capture application in the first device.