技术标签: python 视频技术 数值解析 ffmpeg实例 ffmpeg hybase 编程细节
以下是ffmpeg执行转码记录到文件的日志,为了能分析转码结果,其中PSNR,平均码率等数值是比较常用的参数,为了能快速提前结果,本文提供了一个python的处理样例,请参考后续代码。
[email protected] http://blog.csdn.net/zymill
================== start ====================
ffmpeg version 4.4 Copyright (c) 2000-2021 the FFmpeg developers
built with gcc 4.8.5 (GCC) 20150623 (Red Hat 4.8.5-11)
configuration: --enable-shared --enable-static --enable-version3 --enable-gpl --enable-nonfree --enable-libfdk-aac --enable-libmp3lame --enable-libx264 --enable-libx265 --enable-libxvid --enable-libdavs2 --enable-libxavs2 --enable-libdav1d --enable-libuavs3d --enable-cuda --enable-libnpp --enable-cuvid --enable-ffnvcodec --enable-nvenc --enable-nvdec --enable-zlib --enable-libfribidi --enable-libfreetype --enable-libxml2 --enable-fontconfig --enable-libass --enable-frei0r --enable-opengl --enable-filter=gltransition --extra-libs='-lGLEW -lglfw3' --extra-cflags=-I/usr/local/cuda/include --extra-ldflags=-L/usr/local/cuda/lib64
libavutil 56. 70.100 / 56. 70.100
libavcodec 58.134.100 / 58.134.100
libavformat 58. 76.100 / 58. 76.100
libavdevice 58. 13.100 / 58. 13.100
libavfilter 7.110.100 / 7.110.100
libswscale 5. 9.100 / 5. 9.100
libswresample 3. 9.100 / 3. 9.100
libpostproc 55. 9.100 / 55. 9.100
Input #0, mpegts, from '/home/ffmpeg-4.4.x/ai_trc_shfiles/nobody_hls/nobody_00001.ts':
Duration: 00:00:04.67, start: 6.822000, bitrate: 4077 kb/s
Program 1
Metadata:
service_name : hys Dec 22 2021 17:52:49_s01
service_provider: hys
Stream #0:0[0x100]: Video: h264 (High) ([27][0][0][0] / 0x001B), yuv420p(progressive), 1920x1080 [SAR 1:1 DAR 16:9], 23.98 fps, 23.98 tbr, 90k tbn, 47.95 tbc
Stream mapping:
Stream #0:0 -> #0:0 (h264 (native) -> h264 (libx264))
Press [q] to stop, [?] for help
[libx264 @ 0x15ad340] --psnr used with psy on: results will be invalid!
[libx264 @ 0x15ad340] --tune psnr should be used if attempting to benchmark psnr!
[libx264 @ 0x15ad340] using SAR=1/1
[libx264 @ 0x15ad340] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512
[libx264 @ 0x15ad340] profile High, level 4.0, 4:2:0, 8-bit
[mpegts @ 0x1591380] [mpegts_init] init first_pcr=22950000 (90khz:76500, ms:850), timestamp mode: copyts=-1
[mpegts @ 0x1591380] [mpegts_init] cbr mode: mux_rate=0 bps, pcr_period=33 ms pat_period=80 ms
[mpegts @ 0x1591380] [mpegts_init] ts->copyts= -1, first_pcr= 22950000, last_pcr=22950000, max_delay= 850000us, period ms: pat=80 sdt=1200 pcr=33, mpegts_flags: hys_mux= 0 no_keyframe_pcr= 0 av_same_mux_delay= 0
Output #0, mpegts, to '/home/ffmpeg-4.4.x/ai_trc_shfiles/nobody_trc/trc_nobody_00001.ts':
Metadata:
encoder : Lavf58.76.100
Stream #0:0: Video: h264, yuv420p(progressive), 1920x1080 [SAR 1:1 DAR 16:9], q=2-31, 23.98 fps, 90k tbn
Metadata:
encoder : Lavc58.134.100 libx264
Side data:
cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A
frame= 1 fps=0.0 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed= 0x
frame= 57 fps= 24 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed= 0x
frame= 69 fps= 24 q=0.0 size= 0kB time=00:00:00.00 bitrate=N/A speed= 0x
frame= 80 fps= 23 q=28.0 size= 256kB time=00:00:00.33 bitrate=6285.0kbits/s speed=0.0979x
frame= 92 fps= 23 q=28.0 size= 512kB time=00:00:00.83 bitrate=5028.1kbits/s speed=0.211x
frame= 103 fps= 23 q=28.0 size= 512kB time=00:00:01.29 bitrate=3243.9kbits/s speed=0.286x
frame= 112 fps= 22 q=28.0 size= 768kB time=00:00:01.66 bitrate=3771.1kbits/s speed=0.331x
frame= 112 fps= 17 q=-1.0 Lsize= 2577kB time=00:00:04.58 bitrate=4601.6kbits/s speed=0.678x
video:2487kB audio:0kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 3.603857%
[libx264 @ 0x15ad340] frame I:3 Avg QP:18.16 size: 59277 PSNR Mean Y:48.36 U:52.91 V:52.56 Avg:49.38 Global:49.30
[libx264 @ 0x15ad340] frame P:51 Avg QP:22.12 size: 32512 PSNR Mean Y:46.27 U:51.56 V:52.01 Avg:47.44 Global:47.08
[libx264 @ 0x15ad340] frame B:58 Avg QP:25.41 size: 12263 PSNR Mean Y:44.93 U:50.65 V:51.14 Avg:46.16 Global:45.98
[libx264 @ 0x15ad340] consecutive B-frames: 7.1% 67.9% 10.7% 14.3% 0.0%
[libx264 @ 0x15ad340] mb I I16..4: 34.1% 59.9% 6.0%
[libx264 @ 0x15ad340] mb P I16..4: 7.8% 24.6% 1.8% P16..4: 22.2% 5.5% 3.0% 0.0% 0.0% skip:35.0%
[libx264 @ 0x15ad340] mb B I16..4: 2.4% 4.9% 0.2% B16..8: 26.3% 3.7% 0.6% direct: 3.2% skip:58.7% L0:47.5% L1:45.0% BI: 7.5%
[libx264 @ 0x15ad340] 8x8 transform intra:69.3% inter:75.8%
[libx264 @ 0x15ad340] direct mvs spatial:81.0% temporal:19.0%
[libx264 @ 0x15ad340] coded y,uvDC,uvAC intra: 42.2% 51.3% 8.0% inter: 11.6% 13.8% 0.1%
[libx264 @ 0x15ad340] i16 v,h,dc,p: 46% 15% 11% 28%
[libx264 @ 0x15ad340] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 38% 13% 14% 4% 6% 8% 6% 7% 5%
[libx264 @ 0x15ad340] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 32% 14% 5% 5% 10% 11% 8% 8% 7%
[libx264 @ 0x15ad340] i8c dc,h,v,p: 44% 18% 31% 8%
[libx264 @ 0x15ad340] Weighted P-Frames: Y:9.8% UV:3.9%
[libx264 @ 0x15ad340] ref P L0: 84.6% 11.6% 2.8% 1.0%
[libx264 @ 0x15ad340] ref B L0: 91.3% 7.2% 1.6%
[libx264 @ 0x15ad340] SSIM Mean Y:0.9919295 (20.931db)
[libx264 @ 0x15ad340] PSNR Mean Y:45.632 U:51.126 V:51.573 Avg:46.828 Global:46.513 kb/s:4362.20
解析上述日志中最后一行,获取PNSR,码率的Python方法如下,供参考
#/****************************************************************************
#* mark_trc.py: mark for x264 encode base on PSNR and bitrate
# python 3.x
#*****************************************************************************
#* Copyright (c) 2021-2022 [email protected]
#* http://blog.csdn.net/zymill
#* http://github.com/zymill
#*****************************************************************************
#* Last update Date: 2021-12-30 19:30:25 version: 0.0.1
#*****************************************************************************
#*/
import os
import re
import sys
import logging
import linecache
from logging import handlers
################################################################################
# 常量参数
trc_target_avg_kbps_sd = 1500 # 720x576
trc_target_avg_kbps_hd = 2000 # 1280x720
trc_target_avg_kbps_uhd = 3000 # 1920x1080
trc_target_avg_kbps_4k = 10000 # 3840x2160
sd_r_list = []
hd_r_list = []
uhd_r_list = []
u4k_r_list = []
score_list = []
################################ logger ######################################
class Logger(object):
level_relations = {
'debug':logging.DEBUG,
'info':logging.INFO,
'warning':logging.WARNING,
'error':logging.ERROR,
'crit':logging.CRITICAL
}
def __init__(self, filename, level='info', when='D', backCount=3, fmt='%(asctime)s - %(pathname)s[line:%(lineno)d] - [%(levelname)s] %(message)s'):
self.logger = logging.getLogger(filename)
format_str = logging.Formatter(fmt)
self.logger.setLevel(self.level_relations.get(level))
sh = logging.StreamHandler()
sh.setFormatter(format_str)
th = handlers.TimedRotatingFileHandler(filename=filename,when=when,backupCount=backCount,encoding='utf-8')
th.setFormatter(format_str)
self.logger.addHandler(sh)
self.logger.addHandler(th)
################################################################################
# Methods check dir/file
def traverseDir(root_path, file_list, dir_list):
dir_or_files = os.listdir(root_path)
for dir_file in dir_or_files:
dir_file_path = os.path.join(root_path, dir_file)
if os.path.isdir(dir_file_path):
dir_list.append(dir_file_path)
traverseDir(dir_file_path, file_list, dir_list)
else:
file_list.append(dir_file_path)
#################################################################################
# check dst_path, create it if not found
def createDirIfNotFound(log, dst_path):
if not os.path.exists(dst_path):
os.makedirs(dst_path)
log.logger.info('created ' + dst_path)
return
log.logger.info('found ' + dst_path)
return
#################################################################################
# parse float value from line
def parseKeyValueFromLine(log, line, pattern):
val = 0.0
result_lst = pattern.findall(line)
log.logger.info(result_lst)
if (len(result_lst) > 0):
val = float(result_lst[0])
return val
#################################################################################
# parse video resolution
# Stream #0:0: Video: h264, yuv420p(progressive), 1920x1080 [SAR 1:1 DAR 16:9], q=2-31, 23.98 fps, 90k tbn
def parsetVideoResolution(log, line):
global video_width
global video_height
# default value
video_width = 0
video_height = 0
end = line.find("[SAR")
if (end == -1):
return
sub_str = line[0 : end-1]
start = sub_str.rfind(",");
if (start == -1) :
return
sub_str = sub_str[start + 1 : ]
log.logger.info('video resolution str: ' + sub_str)
result_list = sub_str.split('x')
if (len(result_list) == 2):
video_width = int(result_list[0])
video_height = int(result_list[1])
return
#################################################################################
# parse PSNR and bitrate
# [libx264 @ 0x2503340 ] PSNR Mean Y:59.691 U:62.357 V:62.717 Avg:60.439 Global:59.903 kb/s:44056.39
def parsePSNRBitRate(log, line):
global y_psnr
global u_psnr
global v_psnr
global avg_psnr
global global_psnr
global avg_bitrate
# default value
y_psnr = 0.0
u_psnr = 0.0
v_psnr = 0.0
avg_psnr = 0.0
global_psnr = 0.0
avg_bitrate = 0.0
log.logger.debug('psnr_bitrate line: ' + line)
if ((-1 == line.find("PSNR Mean")) or (-1 == line.find("kb/s:"))) :
return
y_psnr = parseKeyValueFromLine(log, line, re.compile(r'(?<=Y:)\d+\.?\d*'))
u_psnr = parseKeyValueFromLine(log, line, re.compile(r'(?<=U:)\d+\.?\d*'))
v_psnr = parseKeyValueFromLine(log, line, re.compile(r'(?<=V:)\d+\.?\d*'))
avg_psnr = parseKeyValueFromLine(log, line, re.compile(r'(?<=Avg:)\d+\.?\d*'))
global_psnr = parseKeyValueFromLine(log, line, re.compile(r'(?<=Global:)\d+\.?\d*'))
avg_bitrate = parseKeyValueFromLine(log, line, re.compile(r'(?<=kb/s:)\d+\.?\d*'))
return
#################################################################################
# [libx264 @ 0x2503340 ] SSIM Mean Y:0.9995006 (33.016db)
def parseSSIM(log, line):
global avg_ssim
avg_ssim = 0
log.logger.info('line: ' + line)
avg_ssim = parseKeyValueFromLine(log, line, re.compile(r'(?<=SSIM Mean Y::)\d+\.?\d*'))
return
#################################################################################
# process one log file
def parseOneLogFile(log, file_name):
if not os.path.exists(file_name):
log.logger.error('file not existed ' + file_name)
return
log.logger.info('start to parse results from file: ' + file_name)
global video_width
global video_height
global y_psnr
global u_psnr
global v_psnr
global avg_psnr
global global_psnr
global avg_bitrate
global avg_ssim
# default value
video_width = 0
video_height = 0
y_psnr = 0.0
u_psnr = 0.0
v_psnr = 0.0
avg_psnr = 0.0
global_psnr = 0.0
avg_bitrate = 0.0
for line_idx, line in enumerate(open(file_name, "rU")):
if ((-1 != line.find("Stream")) and \
(-1 != line.find("Video")) and \
(-1 != line.find("[SAR")) ):
parsetVideoResolution(log, line)
if (-1 != line.find("PSNR Mean") and \
(-1 != line.find("kb/s:"))):
parsePSNRBitRate(log, line)
if (-1 != line.find("SSIM Mean")):
parseSSIM(log, line)
log.logger.info('##########################################################################')
log.logger.info(' video resolution: %d x %d', video_width, video_height)
log.logger.info(' y_psnr : %f', y_psnr)
log.logger.info(' u_psnr : %f', u_psnr)
log.logger.info(' v_psnr : %f', v_psnr)
log.logger.info(' avg_psnr : %f', avg_psnr)
log.logger.info(' global_psnr : %f', global_psnr)
log.logger.info(' avg_bitrate : %f', avg_bitrate)
log.logger.info(' avg_ssim : %f', avg_ssim)
log.logger.info('##########################################################################')
return
################################################################################
# 常量参数
trc_target_avg_kbps_sd = 1500 # 720x576
trc_target_avg_kbps_hd = 2000 # 1280x720
trc_target_avg_kbps_uhd = 3000 # 1920x1080
trc_target_avg_kbps_4k = 10000 # 3840x2160
trc_log_dir_windows = "g:/ai_trc_shfiles/nobody_trc"
trc_log_dir_linux = ""
trc_log_dir = trc_log_dir_windows
# global variables
video_width = 0
video_height = 0
y_psnr = 0.0
u_psnr = 0.0
v_psnr = 0.0
avg_psnr = 0.0
global_psnr = 0.0
avg_bitrate = 0.0
avg_ssim = 0.0 # 暂时未用 ssim 和 psnr 是两种独立的评估方法
#################################################################################
# main process
if __name__ == "__main__":
log = Logger('sys_all.log',level='debug')
Logger('sys_err.log', level='error')
log.logger.info("========= program start ============")
## 获取日志文件列表
root_path = trc_log_dir
# 文件路径列表
file_list = []
log_file_list = []
# 目录路径列表
dir_list = []
traverseDir(root_path, file_list, dir_list)
## show directory info
log.logger.info('>>>')
log.logger.info('found %d dir', len(dir_list))
for dir in dir_list:
log.logger.info(dir)
## filter log file BY ".log"
for file in file_list:
if (-1 != file.rfind(".log")): ## valid transcode log file
log.logger.debug(file)
log_file_list.append(file)
# show log file info
log.logger.info('>>>')
log.logger.info('found %d log files (in %d files)', len(log_file_list), len(file_list))
for file in log_file_list:
## 从日志中解析视频分辨率, PSNR, bitrate等值
parseOneLogFile(log, file)
## end of all cases
log.logger.info("=== end ===")
log.logger.info("============ program exit ============")
sys.exit(0)
################################################################################
# end
#
文章浏览阅读2.5w次,点赞6次,收藏50次。官方解释是,docker 容器是机器上的沙盒进程,它与主机上的所有其他进程隔离。所以容器只是操作系统中被隔离开来的一个进程,所谓的容器化,其实也只是对操作系统进行欺骗的一种语法糖。_docker菜鸟教程
文章浏览阅读5.7k次,点赞3次,收藏14次。该如何避免的,今天小编给大家推荐两个下载Windows系统官方软件的资源网站,可以杜绝软件捆绑等行为。该站提供了丰富的Windows官方技术资源,比较重要的有MSDN技术资源文档库、官方工具和资源、应用程序、开发人员工具(Visual Studio 、SQLServer等等)、系统镜像、设计人员工具等。总的来说,这两个都是非常优秀的Windows系统镜像资源站,提供了丰富的Windows系统镜像资源,并且保证了资源的纯净和安全性,有需要的朋友可以去了解一下。这个非常实用的资源网站的创建者是国内的一个网友。_msdn我告诉你
文章浏览阅读1.2k次。vue2封装对话框el-dialog组件_
文章浏览阅读4.7k次,点赞5次,收藏6次。MFC 文本框换行 标签: it mfc 文本框1.将Multiline属性设置为True2.换行是使用"\r\n" (宽字符串为L"\r\n")3.如果需要编辑并且按Enter键换行,还要将 Want Return 设置为 True4.如果需要垂直滚动条的话将Vertical Scroll属性设置为True,需要水平滚动条的话将Horizontal Scroll属性设_c++ mfc同一框内输入二行怎么换行
文章浏览阅读832次。检查Linux是否是否开启所需端口,默认为6379,若未打开,将其开启:以root用户执行iptables -I INPUT -p tcp --dport 6379 -j ACCEPT如果还是未能解决,修改redis.conf,修改主机地址:bind 192.168.85.**;然后使用该配置文件,重新启动Redis服务./redis-server redis.conf..._redis-server doesn't support auth command or ismisconfigured. try
文章浏览阅读4.9k次。济大数电实验报告_数据选择器及其应用
文章浏览阅读236次。1研究内容消费在生产中占据十分重要的地位,是生产的最终目的和动力,是保持省内经济稳定快速发展的核心要素。预测河南省社会消费品零售总额,是进行宏观经济调控和消费体制改变创新的基础,是河南省内人民对美好的全面和谐社会的追求的要求,保持河南省经济稳定和可持续发展具有重要意义。本文建立灰色预测模型,利用MATLAB软件,预测出2019年~2023年河南省社会消费品零售总额预测值分别为21881...._灰色预测模型用什么软件
文章浏览阅读1.2k次。12.4-在Qt中使用Log4Qt输出Log文件,看这一篇就足够了一、为啥要使用第三方Log库,而不用平台自带的Log库二、Log4j系列库的功能介绍与基本概念三、Log4Qt库的基本介绍四、将Log4qt组装成为一个单独模块五、使用配置文件的方式配置Log4Qt六、使用代码的方式配置Log4Qt七、在Qt工程中引入Log4Qt库模块的方法八、获取示例中的源代码一、为啥要使用第三方Log库,而不用平台自带的Log库首先要说明的是,在平时开发和调试中开发平台自带的“打印输出”已经足够了。但_log4qt
文章浏览阅读786次。全局观思维模型,一个教我们由点到线,由线到面,再由面到体,不断的放大格局去思考问题的思维模型。_计算机中对于全局观的
文章浏览阅读330次。一、CountDownLatch介绍CountDownLatch采用减法计算;是一个同步辅助工具类和CyclicBarrier类功能类似,允许一个或多个线程等待,直到在其他线程中执行的一组操作完成。二、CountDownLatch俩种应用场景: 场景一:所有线程在等待开始信号(startSignal.await()),主流程发出开始信号通知,既执行startSignal.countDown()方法后;所有线程才开始执行;每个线程执行完发出做完信号,既执行do..._countdownluach于cyclicbarrier的用法
文章浏览阅读508次。Prometheus 算是一个全能型选手,原生支持容器监控,当然监控传统应用也不是吃干饭的,所以就是容器和非容器他都支持,所有的监控系统都具备这个流程,_-自动化监控系统prometheus&grafana实战
文章浏览阅读4.7k次。输入关键字,可以通过键盘的搜索按钮完成搜索功能。_react search