Does anyone have advice for people without a CS background breaking into the ML industry? Ive been doing undergrad ML research for about 3 years now (from a cogsci perspective) but I find the MLOps/coding/implementation of it difficult and confusing, (especially when classes and objects are introduced). I have no problem copying and pasting different...
If both RoPE and ALiBi work under the assumption that we should assign increasingly lower scores the further apart two tokens are, wouldn't the score be so penalized at some point that even if there is an interesting fact 1 million tokens away, we couldn't retrieve it because the positional encoding would force it to have such a low score? submitted...
Hello all, I am a uni student for a masters in AI. During my bachelors I did my thesis at a company and the lead AI had a PhD in Evolutionary algo's. I had a guest lecture from a lead DS last week from a multi billion dollar online marketplace and he also has a PhD. these are a few examples of Leads with PhDs that I've seen. So this poses the question,...
My Graphic card: RTX 2060 OS: Windows 11 The AI program I want to use tells me to install version 11.8 of Cuda, but I recommend using version 12,4 of my graphics card After installing version 11.8 of Cuda, I downloaded cuDNN v8.9.7, for CUDA 11.x and copied the files inside into each matching folder in the Cuda installation folder. I checked the installation...
I usually only see them being used for inference. Is there a reason that they can be used for inference but not training? submitted by /u/notEVOLVED [link] [comments]
Hi everyone, I've created a sequential model in tensorflow to predict handwritten digits with an accuracy of 0.96. On the dataset provided by tf it performed very well, but if i try to predict on my own handwritten digits it always outputs 8. I have converted the photo to grayscale, resized it to 28 x 28, converted to an array, and reshaped it. Any...