Based on what you describe, this does not appear to be a software issue, but rather a model specification and identification issue.
In particular:
• Error component and ASV models require a proper normalization to be identified. This is discussed in detail in Joan Walker’s PhD thesis. If the model is not properly identified, different parameterizations or different sets of simulation draws can lead
to very different numerical values of the parameters, even though the underlying model behavior is equivalent. In such cases, the log-likelihood value is the meaningful quantity to compare, and it should be very similar across implementations.
• Using only 500 draws is likely insufficient, especially for error component models. With too few draws, simulation noise can mask identification problems, making the estimates appear unstable or inconsistent across software. Increasing the number of draws
is often necessary not only for accuracy, but also to reveal whether the model is truly identified.
As a next step, I would strongly recommend trying Bayesian estimation, which is available in BIOGEME 3.3.2. Bayesian estimation is often very helpful to diagnose identification issues, as lack of identification typically manifests itself through poorly behaved
or non-informative posterior distributions.