Dick, SaschaSaschaDickHerre, JürgenJürgenHerreDelgado, PabloPabloDelgado2024-07-292024-07-292024-04-03https://publica.fraunhofer.de/handle/publica/472239An apparatus (100) according to an embodiment is provided. The apparatus comprises an input interface (110) for receiving a plurality of audio objects of an audio sound scene. Moreover, the apparatus (100) comprises a processor (120). Each of the plurality of audio objects represents a sound source being different from any other sound source being represented by any other audio object of the plurality of audio objects; or at least two of the plurality of audio objects represent a same sound source at different locations. The processor (120) is configured to obtain information on a perceptual difference between two audio objects of the plurality of audio objects depending on a distance metric, wherein the distance metric represents perceptual differences in spatial properties of the audio sound scene. And/or, the processor (120) is configured to process the plurality of audio objects to obtain a plurality of audio object clusters or a plurality of processed audio objects depending on the distance metric.enApparatus and method employing a perception-based distance metric for spatial audioVorrichtung und Verfahren mit einer wahrnehmungsbasierten Distanzmetrik für räumliches AudiopatentEP4346235 A1EP20220198848