Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
L
Large Small Language Model
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
Malik Algelly
Large Small Language Model
Repository graph
Repository graph
You can move around the graph by using the arrow keys.
main
Select Git revision
Branches
6
adapt_model
evaluation
main
default
protected
model
slides
training
6 results
Begin with the selected commit
Created with Raphaël 2.2.0
19
Dec
18
17
15
12
11
10
2
23
Nov
15
10
9
Add train impl to slides
slides
slides
Update slides title and ToC
Update model diagram
Add slideshow
Add gitignore
main
main
fix(adapt_model): changed batch size
adapt_model
adapt_model
fix(adapt_model): changes to params
fix(adapt_model): changed path
fix(adapt_model): added pos to evaluate
debug: add mask and debug dim prob
Add slidev sample slideshow
fix(adapt_model): fixed issue with pos_indices
feat(adapt_model): new potential model architecture
modif: save model with name of run in it
modif: change sequence size to reduce training time
add + modif: add generate files + modif model because of low perf
debug
add entity wandb
add num_layers
modif traning and add script
Merge branch 'evaluation' of gitlab.unige.ch:Malik.Algelly1/large-small-language-model
merge(training->main)
update evaluation.py
evaluation
evaluation
feat(training): made changes to the model, dataset loading and the training loop. Added code for future validation and masking in model.py
training
training
Revert "Add basic model"
Add basic model
model
model
add: architecture of the project
create: dataset class for tokenization
modif readme file
add initial files
Initial commit
Loading