683 followers 149 článků/týdně
[D] Why does nproc_per_node not work for values greater than 1?

Context: Running a training for dinov2 using torchrun. I have two nodes. When I run training (1 gpu per node) w/nproc=1, it works. When I allocate 2 gpus per node, I change nproc to 2. The training then crashes when trying to initialize the model. Any insight on what this could be? submitted by /u/dillpill4 [link] [comments]

Fri May 10, 2024 21:43
[D] Seeking Insights on Time Series Data Augmentation: Python Libraries and Benchmark Datasets

Hey everyone, I'm diving into the world of time series data augmentation and I'm curious about the current state of the art techniques, particularly those that are accessible through Python libraries. Techniques: What are some of the most effective methods for augmenting time series data? Are there any recent advancements or innovative approaches worth...

Fri May 10, 2024 21:43
[P] Google Colab crashes before even training my images dataset.

I have 780 images. All of them are microscopic and I'm doing microplastic image detection. First I did binary classification using U-Net and then VGG-16 transfer learning. Google Colab didn't crash one bit. Worked really well. Now I'm doing multi-class segmentation and pre-processing is kinda same. except for one extra channel for colored masks. But,...

Fri May 10, 2024 21:43
[D] Is Evaluating LLM Performance on Domain-Specific QA Sufficient for a Top-Tier Conference Submission?

Hello, Hello, I'm preparing a paper for a top-tier conference and am grappling with what qualifies as a significant contribution. My research involves comparing the performance of at least five LLMs on a domain-specific question-answering task. For confidentiality, I won't specify the domain. I created a new dataset from Wikipedia, as no suitable dataset...

Fri May 10, 2024 18:43
[D] How are decesion boundry drawn in feature space?

I'm trying to understand how ann vs cnn works. Essentially network is just leaning a mapping function from input to output. But in context of ANN where feature space is represented by data as a dot in N dims feature space. The boundries are non linear and drawn which sperates the feature space. But w.r.t CNN, what is high dimensional space and feature...

Fri May 10, 2024 18:43
[N] Book Lauching: Accelerate Model Training with PyTorch 2.X

Hello everyone! My name is Maicon Melo Alves and I'm a High Performance Computing (HPC) system analyst specialized in AI workloads. I would like to announce that my book "Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process" was recently launched by Packt. This book is for intermediate-level data...

Fri May 10, 2024 18:43

Vytvořte si vlastní zdroj

Jste připraveni to vyzkoušet?
Spusťte 14denní zkušební verzi bez nutnosti platební karty.

Vytvořit účet