Whether to force the computation of gradients for the input embeddings during training. Useful for LORA.