Skip to content

first experiments #49

@ArneBinder

Description

@ArneBinder

general

all the experiments should be executed with the multi-model variant of the individual task models

  • target-model only
  • frozen pre-trained target-model + bert-base-cased
  • frozen pre-trained target-model + frozen other-task-model
  • pre-trained target-model + frozen other-task-model

Hyperparameters:

  • learning rate
  • training time: start with a lot of epochs (50?); early stopping (see patience parameter)?
  • warming: for later (seems interesting, but requires effort)

Co-ref

@tanikina @ArneBinder

EDIT:

NER

@harbecke

  • target-model only
  • frozen pre-trained target-model + bert-base-cased
  • frozen pre-trained target-model + frozen other-task-model
  • pre-trained target-model + frozen other-task-model

PR that adds the NER configs and update the experiment log: #71

RE

@leonhardhennig

Sanity Checks

extractive QA

@StalVars

  • target-model only
  • frozen pre-trained target-model + bert-base-cased
  • frozen pre-trained target-model + frozen other-task-model
  • pre-trained target-model + frozen other-task-model

Weights & Biases project for EQA: https://wandb.ai/stalvars/dataset+squadv2-task+extractive_question_answering-training

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions