Unable to run generation tests for Mamba & Jamba models #30828

amyeroberts · 2024-05-15T12:46:30Z

System Info

transformers version: 4.41.0.dev0
Platform: Linux-5.15.0-1045-aws-x86_64-with-glibc2.31
Python version: 3.10.9
Huggingface_hub version: 0.23.0
Safetensors version: 0.4.2
Accelerate version: 0.29.2
Accelerate config: not found
PyTorch version (GPU?): 2.2.2+cu121 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): 0.7.0 (cpu)
Jax version: 0.4.13
JaxLib version: 0.4.13
Using GPU in script?: No
Using distributed or parallel set-up in script?: No

Who can help?

@gante @zucchini-nlp

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

See #30826

test_assisted_decoding_matches_greedy_search_0_random is forcibly skipped for Jamba because it's necessary to unset _supports_cache_class to resolve failing tests on main.

test_assisted_decoding_matches_greedy_search_0_random appears to pass for Mamba, but this is because all_generative_models is not set in the model tester

Expected behavior

Either test_assisted_decoding_matches_greedy_search_0_random can be run for both models with _supports_cache_class unset or it's not necessary to have _supports_cache_class unset for Jamba

The text was updated successfully, but these errors were encountered:

zucchini-nlp · 2024-05-15T13:39:25Z

Might be related to #30800

amyeroberts mentioned this issue May 15, 2024

Jamba - Skip 4d custom attention mask test #30826

Merged

gante linked a pull request May 28, 2024 that will close this issue

Jamba: revert skipped test #31094

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to run generation tests for Mamba & Jamba models #30828

Unable to run generation tests for Mamba & Jamba models #30828

amyeroberts commented May 15, 2024

zucchini-nlp commented May 15, 2024

Unable to run generation tests for Mamba & Jamba models #30828

Unable to run generation tests for Mamba & Jamba models #30828

Comments

amyeroberts commented May 15, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

zucchini-nlp commented May 15, 2024