-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
free staging buffer early
ep:WebGPU
ort-web webgpu provider
#22943
opened Nov 26, 2024 by
guschmue
Loading…
[Test only] BFloat16 test for SkipSimplifiedLayerNormalization
#22941
opened Nov 25, 2024 by
jiafatom
Loading…
[WebNN] Improve the util function of creating WebNN constant MLOperand
#22935
opened Nov 25, 2024 by
Honry
Loading…
Implementation of flash attention for native webgpu ep
#22932
opened Nov 24, 2024 by
sushraja-msft
Loading…
3 tasks done
Bump onnx from 1.16.1 to 1.17.0 in /onnxruntime/python/tools/transformers/models/phi2
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
#22928
opened Nov 22, 2024 by
dependabot
bot
Loading…
[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value
#22921
opened Nov 21, 2024 by
chilo-ms
Loading…
[js/webgpu] support FlashAttention-2 for attention operator
ep:WebGPU
ort-web webgpu provider
#22915
opened Nov 21, 2024 by
xhcao
Loading…
[QNN EP] [DRAFT] Support Conv float weight/bias.
#22906
opened Nov 20, 2024 by
adrianlizarraga
•
Draft
[js/webgpu] Enable graph capture with memcpy and fix duplicated dispatch
#22883
opened Nov 19, 2024 by
axinging
Loading…
Refactor emulator start and stop functions for clarity and efficiency
platform:mobile
issues related to ONNX Runtime mobile; typically submitted using template
#22861
opened Nov 16, 2024 by
jchen351
Loading…
Keep the model metadata on the generated EP context model (use bridge api)
#22860
opened Nov 15, 2024 by
chilo-ms
Loading…
[TensorRT EP] Fix wrong input order when generating IndexedSubGraph
#22857
opened Nov 15, 2024 by
chilo-ms
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.