Bayesian method: which part is hard to evaluate in Bayesian inference

I have a question about posterior inference in Bayes.

In Bayesian inference, suppose we are given a model p(x|\theta) and a prior distribution p(\theta), we observed the dataset D ={x_1,x_2,...,x_N}, the goal is to estimate the usually intractable posterior p(\theta|D).

Sometimes I find some ones choose to evaluate the joint p(\theta,D) because this joint distribution is proportional to posterior p(\theta|D) = p(\theta,D)/p(D), what is the reason behind this? Isn't p(D) is hard to evaluate? Thank you for any advice.

Solution

You want to maximise p(θ|D) by finding the optimal parameters \theta.

This can be rewritten as argmax P( θ | D) P(D)

However, P(D) is independent of θ. Hence you can ignore it or in readable mathematical notation

ALS (Alternating Least Square) algorithm in multiple rankings for a user
How does one set the pad token correctly (not to eos) during fine-tuning to avoid model not predicting EOS?
java.lang.AssertionError: "Does not support data type INT32" in Android Studio
How to create image of confusion matrix in Python
Cross-validation with nb method
The “Forward/Backward Passage Size” is too large for the pytorch model (Yolov3)
How many images(minimum) should be there in each classes for training YOLO?
Why do neural networks work so well?
Will larger batch size make computation time less in machine learning?
Why KL divergence is negative in Pytorch?
Creating a voice identification system using machine learning
Forward pass with all samples
fit method in sklearn
Calibrating Probabilities in lightgbm or XGBoost
Implementation of F1-score, IOU and Dice Score
Is it ok to have the training history very similar to the validation history?
How to understand Shapley value for binary classification problem?
Stochastic Gradient Descent for Logistic Regression always returns a cost of Inf and weight vector never gets any closer
Data format for Libsvm SVR training in Matlab
Why shouldn't we use multiple activation functions in the same layer?
How to implement a butterworth filter
weka java api stringtovector exception
Predict training data in sklearn
Text classification with weka
How can i apply feature reduction methods in Weka?
'super' object has no attribute '__sklearn_tags__'
lightgbm.cv: cvbooster.best_iteration always returns -1
Wrong detection from yolov5 model
torchrl: Using SyncDataCollector with a custom pytorch dqn
OCR Preprocessing for Oman License Plates - Issues with Alphabet Recognition