Notepad Using Python Code Flowchart

10 things I learned from burning myself out with AI coding agents

Like all AI models based on the Transformer architecture, the large language models (LLMs) that underpin today’s coding ...

I tried vibe coding an app as a beginner - here's what Cursor and Replit taught me

I tried four vibe-coding tools, including Cursor and Replit, with no coding background. Here's what worked (and what didn't).

GitHub

hifaz1012/AIStudio-LLM-Evaluation-Samples

Azure AI Studio offers 3 types of Large Language Model (LLM) Evaluations. Manual Evaluation: Manual review of LLM Responses by human reviewers and domain experts ...

IEEE

VulnGPT: Enhancing Source Code Vulnerability Detection Using AutoGPT and Adaptive Supervision Strategies

Abstract: In this paper, we present a novel approach to vulnerability detection in source code using a collaborative setup built on top of AutoGPT, with a controller and an evaluator AI working ...

IEEE

Multi-Label Code Error Classification Using CodeT5 and ML-KNN

Abstract: Programming is an essential skill in computer science and in a wide range of engineering-related disciplines. However, occurring errors, often referred to as “bugs” in code, can indeed be ...

GitHub

DeepCode: Open Agentic Coding

DeepCode achieves 75.9% on the 3-paper human evaluation subset, surpassing the best-of-3 human expert baseline (72.4%) by +3.5 percentage points. This demonstrates that our framework not only matches ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results