Evaluating Foundation Models on Timbre Cognition Tasks