Build your own newsfeed

653 followers 171 článkov/týždeň

[D] But what does a trained Convolution Neural Network actually learn? Visualized!

Sharing a video from my YT channel explaining convolution and visualizing how kernels are learnt… enjoy! submitted by /u/AvvYaa [link] [comments]

Sat Apr 27, 2024 18:34

[P] Classification finetuning experiments on small GPT-2 sized LLMs

I ran a few classification finetuning experiments on relatively "small" experiments that I found interesting and wanted to share: Model Weights Trainable token Trainable layers Context length CPU/GPU Training time Training acc Validation acc Test acc 1 gpt2-small (124M) pretrained last last_block longest train ex. (120) V100 0.39 min 96.63% 97.99%...

Sat Apr 27, 2024 15:34

[R] Transfer learning in environmental data-driven models

Brand new paper published in Environmental Modelling & Software. We investigate the possibility of training a model in a data-rich site and reusing it without retraining or tuning in a new (data-scarce) site. The concepts of transferability matrix and transferability indicators have been introduced. Check out more here: https://www.researchgate.net/publication/380113869_Transfer_learning_in_environmental_data-driven_models_A_study_of_ozone_forecast_in_the_Alpine_region...

Sat Apr 27, 2024 15:34

[D] Llama-3 based OpenBioLLM-70B & 8B: Outperforms GPT-4, Gemini, Meditron-70B, Med-PaLM-1 & Med-PaLM-2 in Medical-domain

Open Source Strikes Again, We are thrilled to announce the release of OpenBioLLM-Llama3-70B & 8B. These models outperform industry giants like Openai’s GPT-4, Google’s Gemini, Meditron-70B, Google’s Med-PaLM-1, and Med-PaLM-2 in the biomedical domain, setting a new state-of-the-art for models of their size. The most capable openly available Medical-domain...

Sat Apr 27, 2024 15:34

How do I convince my superior to do data preprocessing? [D]

How do I convince my superior to do data preprocessing? Hello, I’m working as an AI Engineer for a year at my current company (got masters in cs with data science specialization). We want to build chatbots specialized on chit chat (mostly conversational chats) in specific languages. The problem is that I’m not agreeing with my superior‘s approach to...

Sat Apr 27, 2024 15:34

[Research] Synergies Between Machine Learning and Reasoning (2024)

submitted by /u/blackgreenolive [link] [comments]

Sat Apr 27, 2024 15:34

Vytvorte si vlastný informačný kanál

Ste pripravení to vyskúšať?
Začnite 14-dňovú skúšobnú verziu, kreditná karta sa nevyžaduje.

Založiť účet