657 followers 178 件/週
[D] Advice for Non CS Major in ML

Does anyone have advice for people without a CS background breaking into the ML industry? Ive been doing undergrad ML research for about 3 years now (from a cogsci perspective) but I find the MLOps/coding/implementation of it difficult and confusing, (especially when classes and objects are introduced). I have no problem copying and pasting different...

Mon Apr 29, 2024 18:32
[D] How can attention mechanisms retrieve meaningful information over long distances when using RoPE or ALiBi?

If both RoPE and ALiBi work under the assumption that we should assign increasingly lower scores the further apart two tokens are, wouldn't the score be so penalized at some point that even if there is an interesting fact 1 million tokens away, we couldn't retrieve it because the positional encoding would force it to have such a low score? submitted...

Mon Apr 29, 2024 18:32
[D] Do Lead's in an AI/DS/ML team always have PhDs, is it a requirement?

Hello all, I am a uni student for a masters in AI. During my bachelors I did my thesis at a company and the lead AI had a PhD in Evolutionary algo's. I had a guest lecture from a lead DS last week from a multi billion dollar online marketplace and he also has a PhD. these are a few examples of Leads with PhDs that I've seen. So this poses the question,...

Mon Apr 29, 2024 18:32
[D] Why my GPU doesn't work?

My Graphic card: RTX 2060 OS: Windows 11 The AI program I want to use tells me to install version 11.8 of Cuda, but I recommend using version 12,4 of my graphics card After installing version 11.8 of Cuda, I downloaded cuDNN v8.9.7, for CUDA 11.x and copied the files inside into each matching folder in the Cuda installation folder. I checked the installation...

Mon Apr 29, 2024 18:32
[D] Can AI accelerators be used for training?

I usually only see them being used for inference. Is there a reason that they can be used for inference but not training? submitted by /u/notEVOLVED [link] [comments]

Mon Apr 29, 2024 18:32
[D] Sequential model bad at predicting my own handwriting?

Hi everyone, I've created a sequential model in tensorflow to predict handwritten digits with an accuracy of 0.96. On the dataset provided by tf it performed very well, but if i try to predict on my own handwritten digits it always outputs 8. I have converted the photo to grayscale, resized it to 28 x 28, converted to an array, and reshaped it. Any...

Mon Apr 29, 2024 15:33

自分のためのニュースフィードを組み立てよう

準備はよろしいですか?
14 日間のトライアルをはじめましょう。クレジットカードは不要です。

アカウントを作成