683 followers 146 článkov/týždeň
[D] Is Evaluating LLM Performance on Domain-Specific QA Sufficient for a Top-Tier Conference Submission?

Hello, Hello, I'm preparing a paper for a top-tier conference and am grappling with what qualifies as a significant contribution. My research involves comparing the performance of at least five LLMs on a domain-specific question-answering task. For confidentiality, I won't specify the domain. I created a new dataset from Wikipedia, as no suitable dataset...

Fri May 10, 2024 18:43
[D] How are decesion boundry drawn in feature space?

I'm trying to understand how ann vs cnn works. Essentially network is just leaning a mapping function from input to output. But in context of ANN where feature space is represented by data as a dot in N dims feature space. The boundries are non linear and drawn which sperates the feature space. But w.r.t CNN, what is high dimensional space and feature...

Fri May 10, 2024 18:43
[N] Book Lauching: Accelerate Model Training with PyTorch 2.X

Hello everyone! My name is Maicon Melo Alves and I'm a High Performance Computing (HPC) system analyst specialized in AI workloads. I would like to announce that my book "Accelerate Model Training with PyTorch 2.X: Build more accurate models by boosting the model training process" was recently launched by Packt. This book is for intermediate-level data...

Fri May 10, 2024 18:43
[D] Best community/website to find ML engineer interested in hourly work

I've been searching for a machine learning engineer on platforms like Upwork, but many of the candidates seem to have limited experience in building models from scratch. They often focus on integrating pre-built ML APIs rather than developing custom models tailored to specific requirements. Where is the best place to find ML engineers that can handle...

Fri May 10, 2024 15:42
[D] What on earth is "discretization" step in Mamba?

What is there to "discretize"? Isn't the signal / sequence already "discrete" in the form of tokens? Please don't send me over to wikipedia article about "Discretization of linear state space models ", because I cannot draw any connection to LLMs. It seems to me that Mamba at its core is just EMA with dynamic alpha parameter that is calculated from...

Fri May 10, 2024 15:42
Pycaret unstable [D]

I have a forecasting application backed by pycaret, however suddenly at times pycaret based models raise an unkown exception and suddenly after a day or so it starts working . I am unable to understand error nor this exception as the exception says this exception should not have occured. On debugging on my local the same inputs are working fine. Does...

Fri May 10, 2024 15:42

Vytvorte si vlastný informačný kanál

Ste pripravení to vyskúšať?
Začnite 14-dňovú skúšobnú verziu, kreditná karta sa nevyžaduje.

Založiť účet