-
Notifications
You must be signed in to change notification settings - Fork 731
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update jagged_acc_weights_and_counts and jagged_slice_cpu bench
cla signed
fb-exported
meta-exported
#5620
opened Apr 11, 2026 by
q10
Contributor
Loading…
Remove stale ROCm 5.7 skip checks and dead SM70 code in tests
cla signed
module: rocm
#5619
opened Apr 11, 2026 by
cyyever
Contributor
Loading…
Simplify is_torchdynamo_compiling to direct import from torch.compiler
cla signed
#5618
opened Apr 11, 2026 by
cyyever
Contributor
Loading…
Cleanup stale code for ROCM < 6.2 and CUDA < 12
cla signed
module: rocm
#5616
opened Apr 11, 2026 by
cyyever
Contributor
Loading…
Add aligned_unique_ptr RAII wrapper to avoid leak risks (#5609)
cla signed
fb-exported
meta-exported
#5615
opened Apr 11, 2026 by
q10
Contributor
Loading…
Add CUDA 13.2 support to CI and release workflows (#5610)
cla signed
fb-exported
meta-exported
#5610
opened Apr 10, 2026 by
gchalump
Contributor
Loading…
Port batched_dense_vec_jagged_2d_mul and jagged_1d_to_truncated_values to tritonbench
cla signed
fb-exported
meta-exported
#5603
opened Apr 9, 2026 by
q10
Contributor
Loading…
Replace rocm-smi with amd-smi across ROCm build, CI, and docs
cla signed
module: rocm
#5597
opened Apr 8, 2026 by
adam360x
Loading…
3 tasks done
bf16 scale/bias for INT4
cla signed
fb-exported
meta-exported
#5595
opened Apr 8, 2026 by
jeetkanjani7
Loading…
Enable more clang-tidy checks on C++20 (#5575)
cla signed
fb-exported
meta-exported
module: rocm
#5588
opened Apr 7, 2026 by
q10
Contributor
Loading…
Add gflag to select feature names for SSD KV embedding table
cla signed
fb-exported
meta-exported
#5585
opened Apr 7, 2026 by
jnwan
Loading…
Split RowWiseSparseAdagradFused.cc.stripped.o from fbcode//admarket/adfinder:adfinder
cla signed
fb-exported
meta-exported
#5578
opened Apr 6, 2026 by
meta-codesync
bot
Loading…
Fix TBE v2 forward kernel for embedding dim > 1024 (#5326)
cla signed
#5569
opened Apr 2, 2026 by
cyyever
Contributor
Loading…
Port expand_into_jagged_permute benchmark to tritonbench
cla signed
fb-exported
meta-exported
#5566
opened Apr 1, 2026 by
q10
Contributor
Loading…
Fix bash scripts to fail correctly for ROCm jobs (#5564)
ciflow/rocm-mi300
cla signed
fb-exported
meta-exported
module: rocm
#5564
opened Mar 31, 2026 by
q10
Contributor
Loading…
Add AMD/ROCm support for SSD TBE inference
cla signed
fb-exported
meta-exported
module: rocm
#5561
opened Mar 31, 2026 by
goldcoderZ
Contributor
Loading…
Add TurboSSDInferenceModule for HSTU serving integration
cla signed
fb-exported
meta-exported
#5560
opened Mar 31, 2026 by
goldcoderZ
Contributor
Loading…
2D weights support for permute_1D_data_kernel_vec
cla signed
fb-exported
meta-exported
#5557
opened Mar 31, 2026 by
kausv
Contributor
Loading…
Add failure logging and alerting for SSD offloading (#5542)
cla signed
fb-exported
meta-exported
#5542
opened Mar 26, 2026 by
Frederick-Zhu
Loading…
Fix DramKV race: hold rlock during inplace update writes
cla signed
#5536
opened Mar 26, 2026 by
cyyever
Contributor
Loading…
Add tests for group_index_select_dim0 mixed-dtype validation
cla signed
fb-exported
meta-exported
#5507
opened Mar 21, 2026 by
q10
Contributor
Loading…
Fix Half2 UVM performance regression with vectorized store
cla signed
fb-exported
meta-exported
module: rocm
#5499
opened Mar 19, 2026 by
q10
Contributor
Loading…
Use atomicAdd for lxu_cache_locking_counter increments/decrements
cla signed
fb-exported
meta-exported
#5479
opened Mar 16, 2026 by
goldcoderZ
Contributor
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.