Hi Niko,
In general this will depend on the method that you use to select stimulus intensities for the 2AFC task. If, for example, you decide to measure the full psychometric function by testing a wide range of offsets (say, 0 to 100 deg), then you will likely require several hundred trials for accurate measurements, whereas you could make do with less using the continuous-report paradigm. However, there are fancier stimulus placement strategies (e.g., adaptive staircases procedures such as QUEST or FAST) that may improve the efficiency of the testing procedure quite a bit, allowing you to place each trial at the most informative intensity and thereby avoiding less informative regions of the curve (say, anything past 40 deg with set size 1 or 2).
If you'd like a quantitative answer to your question, take a look at MemTests/TestSamplingAndFitting, which simulates data from a model using parameters of your choosing and then attempts to recover those parameters using MemFit.
Hope this helps,
Jordan