Processing Large Sessions with Automatic Chunking

CaliAli v1.4 introduces automatic session chunking to handle individual imaging sessions that exceed available system memory. This approach eliminates the need for manual file preparation and provides transparent processing of large datasets.

Implementation

Large imaging sessions are now processed by setting the batch_sz parameter, which automatically divides sessions into memory-manageable chunks:

CaliAli_options.motion_correction.batch_sz = 'auto';        % Estimate from available RAM
CaliAli_options.inter_session_alignment.batch_sz = 'auto';  % Mirrors the estimate

When 'auto' is used, CaliAli samples system memory (falling back to 18 GB if unavailable) and derives a chunk size suited for 300×300 px frames. You can still override the value with an explicit frame count.

Behind the scenes, CaliAli keeps templates consistent, aligns chunks per session, and stitches projections back together. Outputs are almost identical to processing the full session at once—the only differences come from small numerical rounding.

Parameter Selection

Select batch_sz based on available system memory and video dimensions. For 512×512 pixel videos:

System RAM	`'auto'` estimate	Manual override guidance
8 GB	≈ 900 frames	Stay ≤ 1000 if you see swapping
16 GB	≈ 1700 frames	1500–2500 works well
32 GB	≈ 3300 frames	3000–5000 for faster runs
64 GB+	≥ 6500 frames	Increase gradually if monitoring memory

Setting batch_sz = 0 disables chunking, matching legacy behaviour. { title="CaliAli_demo_parameters now leaves batch_sz at 'auto'. Set a numeric value if the heuristic overshoots for your hardware." }

Processing Workflow

Standard CaliAli processing workflow remains unchanged:

CaliAli_options = CaliAli_demo_parameters();

% Enable chunking for large sessions
CaliAli_options.motion_correction.batch_sz = 'auto';
CaliAli_options.inter_session_alignment.batch_sz = 'auto';

% Standard pipeline execution
CaliAli_options = CaliAli_motion_correction(CaliAli_options);
CaliAli_options = CaliAli_align_sessions(CaliAli_options);
CaliAli_cnmfe();

Technical Implementation

When chunking is enabled:

Session Analysis: File dimensions are analyzed to determine optimal chunk boundaries
Chunk Processing: Each chunk is processed independently while maintaining session context
Result Integration: Chunk outputs are combined into session-level results, with per-session frame counts validated before writing
Output Generation: Final outputs match those from complete session processing

The chunking system handles: - Motion correction template maintenance across chunks - Spatial alignment consistency within sessions
- Proper projection calculation combining across temporal segments - Memory-mapped file operations for efficient chunk writing

Migration from Previous Versions

For users of CaliAli v1.3 and earlier:

If you previously used manual file splitting approaches (e.g., CaliAli_divide_videos() or manual session division), you can now:

Remove any manual file splitting steps from your workflow
Use original, unsplit session files as input
Add the batch_sz parameter to enable automatic chunking
Retain all other processing parameters and workflow steps

The automatic chunking system replaces previous manual approaches and provides equivalent results with simplified workflow management.

Performance Considerations

Memory Usage: Monitor system memory during initial processing to verify chunk size selection. Reduce batch_sz if memory limitations occur.

Processing Speed: Chunk size affects processing efficiency. Very small chunks may increase overhead, while very large chunks approach memory limits.

Consistency: Use identical batch_sz values across motion correction and alignment steps for consistent processing.

Troubleshooting

Out-of-memory errors: Reduce batch_sz value (try 1000-1500 frames for severely memory-limited systems)

Slower than expected processing: Consider increasing batch_sz if system memory permits

Disable chunking: Set batch_sz = 0 to process entire sessions without chunking

Notes

Chunking operates at the individual session level, not across multiple sessions
Results are computationally equivalent to processing complete sessions
No changes to downstream analysis workflows are required
Compatible with all existing CaliAli processing modules