Grouper and snapper fish are widely consumed species with high economic value in the global market. To determine their economic value, identifying the species and estimating the weight are essential in the pricing and quality determination of the traded fish. The commonly used manual methods are often time-consuming and labor-intensive. Based on this, a more effective computer-based method is needed for these repetitive tasks. This research aims to analyze the performance of two commonly used deep learning models, YOLO and Faster R-CNN, in detecting species and estimating the weight of specific grouper and snapper fish. The data used consisted of 2991 samples divided into 18 classes. This data was then augmented using rotate and flip features to create 6843 image samples. A threshold of 0.8 was used in the detection process, meaning objects detected with confidence below 0.8 would be ignored. Once trained, the performance of both models was tested using precision, recall, and accuracy parameters to assess how accurately the models predicted fish species from the input data and Mean Absolute Percentage Error (MAPE) to evaluate the estimation results of the models. There were differences in the quantitative evaluation results between the YOLO and Faster R-CNN models. The YOLO model achieved precision, recall, and accuracy rates of 0.98, 0.98, and 0.96, respectively, while the Faster R-CNN model had precision, recall, and accuracy rates of 0.97, 0.98, and 0.95, respectively. Additionally, the MAPE for weight estimation was 2.42% for image data and 3.66% for video data for the YOLO model. In contrast, for the Faster R-CNN model, the results were 14.62% for image data and 13.59% for video data. Thus, it can be concluded that the YOLO model provides better quantitative evaluation results compared to the Faster R-CNN model.