Optional. Caption of the audio to be sent, 0-1024 characters after entities parsing
type: String
String