In this paper we present possibilities to parallelize Deblocking Filter (DF) of H.264 video codec and results on Decoupled Threaded Architecture (DTA). We exploited all the available parallelism in the code in order to make it suitable for DTA architecture. Experimental results show that significant speedup can be achieved and that DTA architecture can efficiently exploit available parallelism.