I think there are three possibilities, in my order of preference:
1. Transcribe all text, omitting (with TNs simply saying "Picture omitted") the graphics, so that the reader knows where they are. Experience leads me to believe that print music does not translate well in tactiles.
2. Omit the graphics and the related text, assuming that the text descriptions are not useful without the pictures.