-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Comparing changes
Open a pull request
base repository: NVIDIA-NeMo/NeMo
base: main
head repository: intercom/NeMo
compare: main
- 20 commits
- 46 files changed
- 5 contributors
Commits on Sep 25, 2025
-
feat: Add Qwen3-Next hybrid attention architecture support
Implement complete Qwen3-Next architecture integration with NeMo Megatron framework: - Add hybrid attention mechanism supporting both full and linear attention layers - Implement gated delta rule for efficient linear attention with O(n) complexity - Create modular layer specifications with dynamic attention type selection - Add custom transformer layer supporting mixed attention mechanisms - Integrate with existing NeMo model infrastructure via megatron_gpt_model.py Key features: - Linear attention with chunk-based and recurrent processing algorithms - L2 normalization and clipping mechanisms for numerical stability - Full tensor parallelism support for distributed training - Memory-efficient processing for long sequences (4K+ tokens) - Configurable layer type patterns for optimal performance/efficiency balance Architecture allows 3B total parameters with ~1B active during inference, providing substantial efficiency gains while maintaining model expressivity. Files added: - nemo/collections/nlp/models/language_modeling/megatron/qwen3_next/__init__.py - nemo/collections/nlp/models/language_modeling/megatron/qwen3_next/qwen3_next_modules.py - nemo/collections/nlp/models/language_modeling/megatron/qwen3_next/qwen3_next_spec.py - nemo/collections/nlp/models/language_modeling/megatron/qwen3_next/qwen3_next_layer.py - nemo/collections/nlp/models/language_modeling/megatron/qwen3_next/README.md Files modified: - nemo/collections/nlp/models/language_modeling/megatron_gpt_model.py
Intercom Engineering committedSep 25, 2025 Configuration menu - View commit details
-
Copy full SHA for 6846fc2 - Browse repository at this point
Copy the full SHA 6846fc2View commit details -
feat: Add FineWeb dataset preprocessing and streaming support
Add comprehensive FineWeb dataset integration for NeMo pre-training: - Create preprocessing script to convert HuggingFace FineWeb to NeMo binary format - Implement streaming dataset support for large-scale web data processing - Add optimized data loading with configurable batch sizes and workers - Support for subset selection (sample-10BT, sample-350BT) for experimentation - Integrate Qwen2 tokenizer for consistent text processing Key features: - Efficient streaming from HuggingFace datasets hub - Automatic text filtering and length validation - Parallel processing with configurable worker count - Memory-efficient JSONL intermediate format - Compatible with NeMo's standard indexed dataset pipeline Files added: - scripts/nlp_language_modeling/preprocess_fineweb_for_qwen3_next.py - nemo/collections/nlp/data/language_modeling/megatron/hf_streaming_dataset.py - nemo/collections/nlp/data/language_modeling/megatron/gpt_fineweb_dataset.py Enables training on web-scale datasets with proper tokenization and formatting for foundation model pre-training workflows.
Intercom Engineering committedSep 25, 2025 Configuration menu - View commit details
-
Copy full SHA for 10b6e04 - Browse repository at this point
Copy the full SHA 10b6e04View commit details -
feat: Add complete Qwen3-Next training configuration and documentation
Add production-ready training setup for 3B parameter Qwen3-Next model: - Complete training configuration for 8-GPU setup with optimal parallelism - Hybrid attention pattern optimized for 3B total/1B active parameters - BFloat16 mixed precision and gradient accumulation for memory efficiency - Integration with preprocessed FineWeb dataset using NeMo binary format - Comprehensive documentation with step-by-step training guide Training configuration features: - Tensor parallelism across 2 GPUs per model replica - Sequence parallel processing for 4K sequence lengths - Activation checkpointing for memory optimization - Cosine annealing scheduler with appropriate warmup - Validation and checkpointing with best model selection Documentation includes: - Complete preprocessing pipeline instructions - Training command examples and parameter explanations - Performance tuning guidelines for different GPU configurations - Troubleshooting section for common training issues - Architecture details and scaling recommendations Files added: - qwen3_next_3b_fineweb_training.yaml - QWEN3_NEXT_TRAINING_GUIDE.md Ready for production training of hybrid attention language models on web-scale data.
Configuration menu - View commit details
-
Copy full SHA for 0e2bc0a - Browse repository at this point
Copy the full SHA 0e2bc0aView commit details
Commits on Sep 30, 2025
-
Bump transformers from 4.51.3 to 4.53.0 in /requirements (#1)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for c4fae31 - Browse repository at this point
Copy the full SHA c4fae31View commit details
Commits on Oct 1, 2025
-
Bump vllm from 0.8.5.post1 to 0.10.1.1 in /requirements (#2)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 470f9c4 - Browse repository at this point
Copy the full SHA 470f9c4View commit details -
Bump vite from 6.3.5 to 6.3.6 in /examples/voice_agent/client (#3)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 6c69d36 - Browse repository at this point
Copy the full SHA 6c69d36View commit details
Commits on Oct 8, 2025
-
Bump vllm from 0.10.1.1 to 0.11.0 in /requirements (#4)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for a2153d8 - Browse repository at this point
Copy the full SHA a2153d8View commit details
Commits on Oct 21, 2025
-
Bump vite from 6.3.6 to 6.4.1 in /examples/voice_agent/client (#5)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 54c2323 - Browse repository at this point
Copy the full SHA 54c2323View commit details
Commits on Mar 3, 2026
-
Bump rollup from 4.43.0 to 4.59.0 in /examples/voice_agent/client (#6)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 314d361 - Browse repository at this point
Copy the full SHA 314d361View commit details
Commits on Mar 26, 2026
-
Bump picomatch from 4.0.2 to 4.0.4 in /examples/voice_agent/client (#7)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for ff60acd - Browse repository at this point
Copy the full SHA ff60acdView commit details
Commits on Apr 8, 2026
-
Bump vite from 6.4.1 to 6.4.2 in /examples/voice_agent/client (#8)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for d963cae - Browse repository at this point
Copy the full SHA d963caeView commit details
Commits on Apr 15, 2026
-
Bump transformers from 4.53.0 to 5.0.0rc3 in /requirements (#9)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 36ea214 - Browse repository at this point
Copy the full SHA 36ea214View commit details
Commits on Apr 21, 2026
-
Configuration menu - View commit details
-
Copy full SHA for ba88b6a - Browse repository at this point
Copy the full SHA ba88b6aView commit details
Commits on Jun 3, 2026
-
Disable GitHub Actions workflows (#15)
Move .github/workflows/*.yml(.yaml) to .github/workflows-disabled/ to neutralize them ahead of the org-wide GitHub Actions enablement. Restore via the re-enable-workflow process when a workflow is intentionally re-enabled. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Configuration menu - View commit details
-
Copy full SHA for a1c444b - Browse repository at this point
Copy the full SHA a1c444bView commit details
Commits on Jun 4, 2026
-
Configuration menu - View commit details
-
Copy full SHA for e26ede8 - Browse repository at this point
Copy the full SHA e26ede8View commit details
Commits on Jun 10, 2026
-
Bump vllm from 0.11.0 to 0.20.0 in /requirements (#11)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 501bbde - Browse repository at this point
Copy the full SHA 501bbdeView commit details -
Bump postcss from 8.5.6 to 8.5.15 in /examples/voice_agent/client (#16)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for f2b5559 - Browse repository at this point
Copy the full SHA f2b5559View commit details
Commits on Jun 16, 2026
-
Bump esbuild, @vitejs/plugin-react-swc and vite (#18)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 5e6f605 - Browse repository at this point
Copy the full SHA 5e6f605View commit details -
Bump vllm from 0.20.0 to 0.22.0 in /requirements (#17)
Automatic merge of Dependabot PR
Configuration menu - View commit details
-
Copy full SHA for 2427db1 - Browse repository at this point
Copy the full SHA 2427db1View commit details
Commits on Jun 17, 2026
-
Configuration menu - View commit details
-
Copy full SHA for 2da1aac - Browse repository at this point
Copy the full SHA 2da1aacView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff main...main