Is this broken or out of date? Lot's of errors trying to get this to work.

by Fieldsweeper - opened Feb 26

Feb 26

I am seeing errors related to kernel size pattern in the bezzam encoder—those strided convolutions with kernels [4, 4, 8, 10, 10, 16] and corresponding downsampling ratios. Among other things actually.

Not sure what could be affecting it, other than perhaps some overlap of your references to the Microsoft official one, (also having this model removed)

bezzam

Owner Feb 27

@Fieldsweeper this is a draft checkpoint to get a version working with Transformers. so it's normal that it isn't working as the code to use it (here) is still in progress / under review. I can let you know when it's ready for testing if you want to try out before it gets merged into Transformers?

SerialVelocity

15 days ago

Should I be using the latest version of the code in that PR? It looks like progress has stalled.

bezzam

Owner 15 days ago

hi @SerialVelocity , thanks for your message. yes that PR is still waiting for a review... we've had lots of releases in between but it is still in the plan to merge it into Transformers.

This draft checkpoint and the 1.5B draft should normally work with the latest code from the PR. Let me know if it doesn't!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment