Best for last - only 3 days left, real novel challenge: direct learning from bitstream ðŸ§
How do we extract features from MPEG-G compressed data without fully decompressing it? Access selective parts of the bitstream (e.g., k-mer counts, alignment summaries, read lengths). How to use Genie's indexing tools for partial decoding. One approach is to build a Python Feature Extractor with Genie’s C++/CLI Interface. Another approach, more experimental but very exciting:
- Treat entropy-coded blocks or CABAC features as direct DL input
- Embed compressed representations into contrastive or transformer models
- Inspired by video compression + ViT work!! To make an analogy: Say you are working on vision-related problems - this would be like using DCT blocks from JPEG for training, not the pixels themselves.
Courrage