Asap: Architecture search, anneal and prune A Noy, N Nayman, T Ridnik, N Zamir, S Doveh, I Friedman, R Giryes, ... International conference on artificial intelligence and statistics, 493-503, 2020 | 118 | 2020 |
Teaching structured vision & language concepts to vision & language models S Doveh, A Arbelle, S Harary, E Schwartz, R Herzig, R Giryes, R Feris, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 70 | 2023 |
Dense and aligned captions (dac) promote compositional reasoning in vl models S Doveh, A Arbelle, S Harary, R Herzig, D Kim, P Cascante-Bonilla, ... Advances in Neural Information Processing Systems 36, 2024 | 38 | 2024 |
Going beyond nouns with vision & language models using synthetic data P Cascante-Bonilla, K Shehada, JS Smith, S Doveh, D Kim, R Panda, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 38 | 2023 |
Starnet: towards weakly supervised few-shot object detection L Karlinsky, J Shtok, A Alfassy, M Lichtenstein, S Harary, E Schwartz, ... Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1743-1753, 2021 | 27* | 2021 |
Detector-free weakly supervised grounding by separation A Arbelle, S Doveh, A Alfassy, J Shtok, G Lev, E Schwartz, H Kuehne, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 26 | 2021 |
DEGAS: differentiable efficient generator search S Doveh, R Giryes Neural Computing and Applications 33, 17173-17184, 2021 | 22 | 2021 |
MAEDAY: MAE for few-and zero-shot AnomalY-Detection E Schwartz, A Arbelle, L Karlinsky, S Harary, F Scheidegger, S Doveh, ... Computer Vision and Image Understanding 241, 103958, 2024 | 21 | 2024 |
MetAdapt: meta-learned task-adaptive architecture for few-shot classification S Doveh, E Schwartz, C Xue, R Feris, A Bronstein, R Giryes, L Karlinsky Pattern Recognition Letters 149, 130-136, 2021 | 21 | 2021 |
Towards multimodal in-context learning for vision & language models S Doveh, S Perek, MJ Mirza, W Lin, A Alfassy, A Arbelle, S Ullman, ... ECCVW 2024, 2024 | 11 | 2024 |
Meta-prompting for automating zero-shot visual recognition with llms MJ Mirza, L Karlinsky, W Lin, S Doveh, J Micorek, M Kozinski, H Kuehne, ... European Conference on Computer Vision, 370-387, 2025 | 8 | 2025 |
NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning E Schwartz, L Choshen, J Shtok, S Doveh, L Karlinsky, A Arbelle arXiv preprint arXiv:2404.00459, 2024 | 6 | 2024 |
Comparison Visual Instruction Tuning W Lin, MJ Mirza, S Doveh, R Feris, R Giryes, S Hochreiter, L Karlinsky arXiv preprint arXiv:2406.09240, 2024 | 2 | 2024 |
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs I Huang, W Lin, MJ Mirza, JA Hansen, S Doveh, VI Butoi, R Herzig, ... arXiv preprint arXiv:2406.08164, 2024 | 2 | 2024 |
Glov: Guided large language models as implicit optimizers for vision language models MJ Mirza, M Zhao, Z Mao, S Doveh, W Lin, P Gavrikov, M Dorkenwald, ... arXiv preprint arXiv:2410.06154, 2024 | 1 | 2024 |
Teaching VLMs to Localize Specific Objects from In-context Examples S Doveh, N Shabtay, W Lin, E Schwartz, H Kuehne, R Giryes, R Feris, ... arXiv preprint arXiv:2411.13317, 2024 | | 2024 |
LiveXiv--A Multi-Modal Live Benchmark Based on Arxiv Papers Content N Shabtay, FM Polo, S Doveh, W Lin, MJ Mirza, L Chosen, M Yurochkin, ... arXiv preprint arXiv:2410.10783, 2024 | | 2024 |
Augmenting In-Context-Learning in LLMs via Automatic Data Labeling and Refinement J Shtok, A Alfassy, FA Dahood, E Schwartz, S Doveh, A Arbelle arXiv preprint arXiv:2410.10348, 2024 | | 2024 |
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models M Jehanzeb Mirza, M Zhao, Z Mao, S Doveh, W Lin, P Gavrikov, ... arXiv e-prints, arXiv: 2410.06154, 2024 | | 2024 |
Training visual language grounding models using separation loss A Arbelle, L Karlinsky, S Doveh, J Shtok, A Alfassy US Patent 11,954,144, 2024 | | 2024 |