Commits · mehdi999/pardi-speech

another test

d09d63c

mehdi999 commited on Oct 31

Changed requirements

78e8994

mehdi999 commited on Oct 31

Corrected pytorch version

39d8b01

mehdi999 commited on Oct 31

fix(glA): init default conv_state at first prefill to avoid NoneType unpack

72dc299

mehdi999 commited on Oct 30

fix: remove leftover cache=cache in text_to_speech call

677e61f

mehdi999 commited on Oct 30

fix: let decoder manage its own cache (avoid None conv_state in FLA)

750aa27

mehdi999 commited on Oct 30

back to basics

fd1f480

mehdi999 commited on Oct 30

runtime: set ZeroGPU duration to 600s

6d19e74

mehdi999 commited on Oct 30

runtime: set ZeroGPU duration to 600s

158b7a6

mehdi999 commited on Oct 30

Space: preload CPU thread + cache + logs

6d29905

mehdi999 commited on Oct 30

added few things

92ec5fe

mehdi999 commited on Oct 30

added few things

58c000c

mehdi999 commited on Oct 30

added few things

313c415

mehdi999 commited on Oct 30

added few things

5edbebd

mehdi999 commited on Oct 30

runtime: set ZeroGPU duration to 600s

018aa79

mehdi999 commited on Oct 30

added few things

fc1f9f2

mehdi999 commited on Oct 30

added few things

9f2e2fc

mehdi999 commited on Oct 30

added few things

2dc4aff

mehdi999 commited on Oct 30

added few things

5997b2e

mehdi999 commited on Oct 30

Fix: safe FLA backend + cache guards

cf9d6a9

mehdi999 commited on Oct 30

added few things

831395c

mehdi999 commited on Oct 30

added few things

55fecbf

mehdi999 commited on Oct 30

added few things

a7c605a

mehdi999 commited on Oct 30

added few things

f6f4ba0

mehdi999 commited on Oct 30

added watch

6b8706f

mehdi999 commited on Oct 30

added watch

3d734f0

mehdi999 commited on Oct 30

runtime: set ZeroGPU duration to 600s

7208504

mehdi999 commited on Oct 30

chore: retrigger build

dd090c8

mehdi999 commited on Oct 29

runtime: extend ZeroGPU duration to 600s

dbbf6f4

mehdi999 commited on Oct 29

fix: pass num_heads to CrossAttention ctor

46e132f

mehdi999 commited on Oct 29

deps: drop causal-conv1d (needs torch at build-time; not required for demo)

5fd5d30

mehdi999 commited on Oct 29

fix: SwiGLU ctor signature (drop out_features); add causal-conv1d

7314545

mehdi999 commited on Oct 29

fix: expose SimpleGLADecoder.config for registry instantiate_from_config

ca4ede6

mehdi999 commited on Oct 29

Force FLA mode=chunk to avoid Triton fused kernels on ZeroGPU

b90b280

mehdi999 commited on Oct 29

chore: retrigger build

50be982

mehdi999 commited on Oct 29

Force FLA mode=chunk to avoid Triton fused kernels on ZeroGPU

a175cfa

mehdi999 commited on Oct 29

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

bb32e4f

mehdi999 commited on Oct 29

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

80ec7ec

mehdi999 commited on Oct 29

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

a69b69a

mehdi999 commited on Oct 29

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

159df2e

mehdi999 commited on Oct 29

fix: remove leftover cache=cache from text_to_speech call

8caf57f

mehdi999 commited on Oct 29

inference: remove empty cache (GLA expects prefetched conv_state)

d7cfd9b

mehdi999 commited on Oct 29

chore: retrigger build

ad26fe4

mehdi999 commited on Oct 29

fix: handle VelocityHeadSamplingParams signature; better error logging

0a0019f

mehdi999 commited on Oct 29

fix: align torch/torchaudio to cu124 (CUDA 12.4)

1075fc0

mehdi999 commited on Oct 29

deps: add flash-linear-attention; bump torch/torchaudio to 2.5.1 and transformers to 4.53.0

c99b6f0

mehdi999 commited on Oct 29

fix: replace zcodec import with relative (TransformerBlock)

b8aa630

mehdi999 commited on Oct 29

inference-only: drop zcodec deps (relative imports) and avoid training/tokenizer imports at package init

b650fff

mehdi999 commited on Oct 29

deps: drop flash-linear-attention to resolve Torch/Transformers conflicts

0d64ef9

mehdi999 commited on Oct 29

deps: align with pyproject; torch/torchaudio pinned to cu121-compatible

6a0a374

mehdi999 commited on Oct 29

Commit History

another test d09d63c

Changed requirements 78e8994

Corrected pytorch version 39d8b01

fix(glA): init default conv_state at first prefill to avoid NoneType unpack 72dc299

fix: remove leftover cache=cache in text_to_speech call 677e61f

fix: let decoder manage its own cache (avoid None conv_state in FLA) 750aa27

back to basics fd1f480

runtime: set ZeroGPU duration to 600s 6d19e74

runtime: set ZeroGPU duration to 600s 158b7a6

Space: preload CPU thread + cache + logs 6d29905

added few things 92ec5fe

added few things 58c000c

added few things 313c415

added few things 5edbebd

runtime: set ZeroGPU duration to 600s 018aa79

added few things fc1f9f2

added few things 9f2e2fc

added few things 2dc4aff

added few things 5997b2e

Fix: safe FLA backend + cache guards cf9d6a9

added few things 831395c

added few things 55fecbf

added few things a7c605a

added few things f6f4ba0

added watch 6b8706f

added watch 3d734f0

runtime: set ZeroGPU duration to 600s 7208504

chore: retrigger build dd090c8

runtime: extend ZeroGPU duration to 600s dbbf6f4

fix: pass num_heads to CrossAttention ctor 46e132f

deps: drop causal-conv1d (needs torch at build-time; not required for demo) 5fd5d30

fix: SwiGLU ctor signature (drop out_features); add causal-conv1d 7314545

fix: expose SimpleGLADecoder.config for registry instantiate_from_config ca4ede6

Force FLA mode=chunk to avoid Triton fused kernels on ZeroGPU b90b280

chore: retrigger build 50be982

Force FLA mode=chunk to avoid Triton fused kernels on ZeroGPU a175cfa

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None) bb32e4f

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None) 80ec7ec

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None) a69b69a

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None) 159df2e

fix: remove leftover cache=cache from text_to_speech call 8caf57f

inference: remove empty cache (GLA expects prefetched conv_state) d7cfd9b

chore: retrigger build ad26fe4

fix: handle VelocityHeadSamplingParams signature; better error logging 0a0019f

fix: align torch/torchaudio to cu124 (CUDA 12.4) 1075fc0

deps: add flash-linear-attention; bump torch/torchaudio to 2.5.1 and transformers to 4.53.0 c99b6f0

fix: replace zcodec import with relative (TransformerBlock) b8aa630

inference-only: drop zcodec deps (relative imports) and avoid training/tokenizer imports at package init b650fff

deps: drop flash-linear-attention to resolve Torch/Transformers conflicts 0d64ef9

deps: align with pyproject; torch/torchaudio pinned to cu121-compatible 6a0a374

another test

d09d63c

Changed requirements

78e8994

Corrected pytorch version

39d8b01

fix(glA): init default conv_state at first prefill to avoid NoneType unpack

72dc299

fix: remove leftover cache=cache in text_to_speech call

677e61f

fix: let decoder manage its own cache (avoid None conv_state in FLA)

750aa27

back to basics

fd1f480

runtime: set ZeroGPU duration to 600s

6d19e74

runtime: set ZeroGPU duration to 600s

158b7a6

Space: preload CPU thread + cache + logs

6d29905

added few things

92ec5fe

added few things

58c000c

added few things

313c415

added few things

5edbebd

runtime: set ZeroGPU duration to 600s

018aa79

added few things

fc1f9f2

added few things

9f2e2fc

added few things

2dc4aff

added few things

5997b2e

Fix: safe FLA backend + cache guards

cf9d6a9

added few things

831395c

added few things

55fecbf

added few things

a7c605a

added few things

f6f4ba0

added watch

6b8706f

added watch

3d734f0

runtime: set ZeroGPU duration to 600s

7208504

chore: retrigger build

dd090c8

runtime: extend ZeroGPU duration to 600s

dbbf6f4

fix: pass num_heads to CrossAttention ctor

46e132f

deps: drop causal-conv1d (needs torch at build-time; not required for demo)

5fd5d30

fix: SwiGLU ctor signature (drop out_features); add causal-conv1d

7314545

fix: expose SimpleGLADecoder.config for registry instantiate_from_config

ca4ede6

Force FLA mode=chunk to avoid Triton fused kernels on ZeroGPU

b90b280

chore: retrigger build

50be982

Force FLA mode=chunk to avoid Triton fused kernels on ZeroGPU

a175cfa

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

bb32e4f

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

80ec7ec

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

a69b69a

fix(GLA): provide safe past_key_values with default conv_state=(None,None,None)

159df2e

fix: remove leftover cache=cache from text_to_speech call

8caf57f

inference: remove empty cache (GLA expects prefetched conv_state)

d7cfd9b

chore: retrigger build

ad26fe4

fix: handle VelocityHeadSamplingParams signature; better error logging

0a0019f

fix: align torch/torchaudio to cu124 (CUDA 12.4)

1075fc0

deps: add flash-linear-attention; bump torch/torchaudio to 2.5.1 and transformers to 4.53.0

c99b6f0

fix: replace zcodec import with relative (TransformerBlock)

b8aa630

inference-only: drop zcodec deps (relative imports) and avoid training/tokenizer imports at package init

b650fff

deps: drop flash-linear-attention to resolve Torch/Transformers conflicts

0d64ef9

deps: align with pyproject; torch/torchaudio pinned to cu121-compatible

6a0a374