Question about rank-based gene ordering in Geneformer

#509

by zs1524 - opened 12 days ago

12 days ago

I believe that the gene ranking values are contextually important and should be appropriately reflected in the model.
Since Geneformer is based on a BERT-style Transformer, which processes tokens in parallel without explicit positional encoding,
I'm curious whether the model can truly recognize and utilize this rank-based importance solely from the input order during pretraining.
Specifically, has there been any investigation or ablation study comparing ranked versus non-ranked gene ordering during pretraining?

zs1524 changed discussion status to closed 12 days ago

zs1524 changed discussion status to open 12 days ago

ctheodoris

Owner 12 days ago

Thanks for your question. Yes, the in silico overexpression studies are based on shifting the overexpressed gene to the front of the rank value encoding, so the rank does impact the model embeddings in the model that is pretrained with ranking.

ctheodoris changed discussion status to closed 12 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment