Filter by tags
An internal cognitive engine for quantitative root cause analysis using Bayesian inference. Helps weigh competing hypotheses, prevent anchoring bias, and determine the most efficient diagnostic steps through probabilistic reasoning.
A project-aware DSP engineer that executes a 4-phase architectural replacement plan for audio processing. Enforces H-Chip 14-pole topology with int32 fixed-point constraints, migrating legacy float-based systems to modern C++ implementations while maintaining strict phase-order compliance.
Implementation guide for X-Trend (Cross Attentive Time-Series Trend Network), combining LSTM encoders, multi-head attention mechanisms, and few-shot learning for trend-following trading strategies with interpretable predictions.
A deterministic effects engine that applies chronicle events to world state entities. Handles battle outcomes, treaties, agent deaths, and other events to modify entity attributes like force strength, relationships, and agent status in simulation.
Systematically verify CVlization inference examples are complete, properly structured, and functional. Validates build process, inference execution, output generation, host filesystem persistence, and model caching.
Fine-tune Speech-to-Text models like OpenAI Whisper using Unsloth's optimized LoRA pipeline, achieving 1.5x faster training with 50% less memory usage for specialized terminology, accents, and dialects.
Performs full fine-tuning (FFT) with 100% exact weight updates using Unsloth's optimized gradient checkpointing, enabling larger batch sizes and complete model modification for base model pre-training and continued pre-training tasks.
Add comprehensive type hints to Python functions and methods with special support for PyTorch tensor types. Improves code maintainability, enables static type checking with mypy, and provides better IDE support for scientific computing code.
Production-ready PyTorch model serving engine that handles MAR packaging, custom handlers for preprocessing/inference, multi-GPU worker scaling, and model version management via REST/gRPC API.
Optimize PyTorch models using torch.compile (TorchDynamo/Inductor) for JIT compilation into optimized kernels. Focuses on reducing Python overhead, managing compile overhead, debugging graph breaks, and proper benchmarking methodology with warmup runs.
Research and recommend optimal tech stacks for rapid MVP development. Analyzes project requirements and recommends programming languages, frameworks, databases, AI models, APIs, and deployment platforms, prioritizing developer experience, free tiers, and battle-tested solutions.
Comprehensive guide for integrating Asleep sleep tracking SDK into Android applications. Covers MVVM architecture, permission handling, state management, Jetpack Compose UI, Kotlin coroutines, foreground services, and real-time sleep data processing with complete code examples.
Deep code and system analysis using Quantum Cognitive OS with multi-dimensional reasoning. Analyzes complex codebases, architectures, security, and performance through semantic, relational, causality, and temporal lenses with persistent learning.
A high-level training framework for PyTorch that automates 40+ engineering details like epoch loops, optimization, and hardware acceleration while maintaining flexibility. Supports multi-GPU/multi-node scaling, reproducibility, and decouples research code from engineering boilerplate.
A library built on PyTorch for implementing Graph Neural Networks (GNNs). Provides MessagePassing layers, modular aggregation schemes, and efficient mini-batching for handling large graphs through disjoint graph representation.
Configure and verify PyTorch CUDA 13 environment including toolkit, driver requirements, and wheel compatibility. Provides guidance on CUDA setup verification, runtime checks, and accurate GPU timing for NVIDIA GPUs.
Expert guidance for Python Optimization Modeling Objects (Pyomo), covering linear/nonlinear programming, mixed-integer optimization, dynamic systems with differential equations, constraint programming, and solver integration with GLPK, IPOPT, Gurobi, and CPLEX.
Enterprise AI platform for building, customizing, and deploying generative AI models and agents. Includes NeMo Retriever for RAG pipelines, NeMo Customizer for fine-tuning, NeMo Guardrails for safety, and tools for data curation, evaluation, and multi-agent orchestration.
Universal functions for vectorized element-wise operations on NumPy arrays, supporting broadcasting, type casting, reductions, accumulate operations, and custom function wrapping for high-performance mathematical computations.
A rigorous methodology that requires constructing an explicit problem model before implementation. Enforces strict separation between modeling and coding phases to reduce hallucinations and ensure correctness in complex logic, state machines, and constraint systems.
Automates creation of MobX State Tree stores with TypeScript integration, following established patterns for domain models, collection stores, and root store integration in mobile applications.
Enterprise performance optimization orchestrator with Context7 integration, Scalene AI profiling, intelligent bottleneck detection, automated optimization strategies, and predictive performance tuning across 25+ programming languages for comprehensive application performance enhancement.
Practical workflows and context engineering strategies for MoAI-ADK, including JIT retrieval, Explore agent usage, SPEC-TDD-Sync execution patterns, and debugging solutions for efficient AI-assisted development.
Execute mechanistic interpretability experiments from JSON specifications. Supports family sweeps, heatmaps, itemset mining, minimal cores, pairwise interactions, and validation experiments. Routes experiment specs to appropriate runners and produces structured JSON results with diagnostics.
Analyze SAE (Sparse Autoencoder) decoder weights to understand feature output influence, importance, and similarity. Reveals what outputs a feature promotes or suppresses, providing complementary insights to activation analysis for interpretable AI.
Converts inefficient Python loops into fast vectorized PyTorch tensor operations, achieving 10-10000x performance improvements by leveraging GPU acceleration and batch processing.
Guides implementation of Higher Inductive Types (HITs) in the ComputationalPaths Lean 4 library, including axiom declarations, recursion principles, fundamental group calculations, and encode-decode proofs for topological spaces.
A systematic guide for implementing research papers step-by-step from scratch. Focuses on building deep understanding through methodical implementation with checkpoint questions, debugging strategies, and verification steps for each component.
Run Python and ML code on cloud GPUs seamlessly. Prefix commands with 'gpu' to execute on remote GPUs via RunPod. Supports model training, ComfyUI, Stable Diffusion, and LLM inference with automatic provisioning, code syncing, and output management.
Execute complete feature development lifecycle using multi-model AI orchestration. Covers research, architecture design, prototyping, testing, security review, and deployment with 12-stage workflow using Gemini, Claude, and Codex models.